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Replication-Competent Anti-Cancer Vectors 
Reference to Government Grant 

This invention was made with government support under a grant from the National 
Institutes of Health, Grant Number ROl CA71704 and CA81829. The United States 
Government has certain rights in this invention. 
5 Background of the Invention 
(1) Field of the Invention 

This invention relates generally to the treatment of cancer and more particularly to 
vectors which replicate in neoplastic cells and which overexpress an adenovirus death protein 
(ADP) and to the use of these vectors in treating human cancer. 
1 0 (2) Description of the Related Art 

Cancer is a leading cause of death in the United States and elsewhere. Depending on 
the type of cancer, it is typically treated with surgery, chemotherapy, and/or radiation. These 
treatments often fail: surgery may not remove all the cancer; some cancers are resistant to 
chemotherapy and radiation therapy; and chemotherapy-resistant tumors frequently develop. 
1 5 New therapies are necessary, to be used alone or in combination with classical techniques. 

One potential therapy under active investigation is treating tumors with recombinant 
viral vectors expressing anti-cancer therapeutic proteins. Adenovirus-based vectors contain 
several characteristics that make them conceptually appealing for use in treating cancer, as 
well as for therapy of genetic disorders. Adenoviruses (hereinafter used interchangeably with 
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"Ads") can easily be grown in culture to high titer stocks that are stable. They have a broad 
host range, replicating in most human cancer cell types. Their genome can be manipulated by 
site-directed mutation and insertion of foreign genes expressed from foreign promoters. 

The adenovirion consists of a DNA-protein core within a protein capsid (reviewed by 
Stewart et al., "Adenovirus structure by x-ray crystallography and electron microscopy." in: 
The Molecular Repertoire of Adenoviruses, Doerfler, W. et al., (ed)., Springer- Verlag, 
Heidelberg, Germany, p. 25-38). Virions bind to a specific cellular receptor, are endocytosed, 
and the genome is extruded from endosomes and transported to the nucleus. The genome is a 
linear duplex DNA of about 36 kbp, encoding about 36 genes (Fig. 1A). In the nucleus, the 
"immediate early" El A proteins are expressed initially, and these proteins induce expression 
of the "delayed early" proteins encoded by the E1B, E2, E3, and E4 transcription units 
(reviewed by Shenk, T. "Adenoviridae: the viruses and their replication" in: Fields Virology, 
Field, B.N. et al., Lippencott-Raven, Philadelphia, p. 21 1 1-2148). El A proteins also induce 
or repress cellular genes, resulting in stimulation of the cell cycle. About 23 early proteins 
function to usurp the cell and initiate viral DNA replication. Viral DNA replicates at about 7 
h post-infection (p.i.), then late genes are expressed from the "major late" transcription unit. 
Major late mRNAs are synthesized from the common "major late promoter" by alternative 
pre-mRNA processing. Each late mRNA contains a common "tripartite leader" at its 5'- 
terminus (exons 1, 2, and 3 in Fig. 1), which allows for efficient translation of Ad late 
mRNAs. Cellular protein synthesis is shut off, and the cell becomes a factory for making 
viral proteins. Virions assemble in the nucleus at about 1 day p.i., and after 2-3 days the cell 
lyses and releases progeny virus. Cell lysis is mediated by the E3 1 1 .6K protein, which has 
been renamed "adenovirus death protein" (ADP) (Tollefson et al., J. Virol. 70:2296-2306, 
1996; Tollefson et al., Virol. 220:152-162, 1996). The term ADP as used herein in a generic 
sense refers collectively to ADP's from adenoviruses such as, e.g. Ad type 1 (Adl), Ad type 2 
(Ad2), Ad type 5 (Ad5) or Ad type 6 (Ad6) all of which express homologous ADP's with a 
high degree of sequence similarity. 

Human adenovirus type 5 (Ad5) is particularly useful for cancer gene therapy. It 
primarily causes asymptomatic or mild respiratory infections in young children, followed by 
long term effective immunity. Fatalities are extremely rare except when the patient is 
immunocompromised (Horwitz, M. S., Adenoviruses, p. 2149-2171 In B. N. Fields, D. M. 
Knipe, and P. M. Howley (eds.), Fields Virology, Lippincott-Raven Publishers, Philadelphia, 
PA, 1996). Ad5 is very well understood, can be grown in culture to high titer stocks that are 
stable, and can replicate in most human cancer cell types (Shenk, T., Adenoviridae: the 
viruses and their replication, p. 21 1 1-2148. In B. N. Fields, D. M. Knipe, and P. M. Howley 
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(eds.), Fields Virology, Lippincott-Raven, Philadelphia, 1996). Its genome can be 
manipulated by site-directed mutagenesis and insertion of foreign sequences. 

The Ad vectors being investigated for use in anti-cancer and gene therapy are based 
on recombinant Ad's that are either replication-defective or replication-competent. Typical 
5 replication-defective Ad vectors lack the E1A and E1B genes (collectively known as El) and 
contain in their place an expression cassette consisting of a promoter and pre-mRNA 
processing signals which drive expression of a foreign gene. The E 1 A proteins induce 
transcription of other Ad genes, and in nontransformed cells they deregulate the cell cycle, 
induce or repress a variety of cellular genes, and force cells from G 0 into S-phase 48 (White, 

1 0 E, Semin. Virol. 5:505-5 1 3, 1998; Wold et al., pp. 200-232 In A J. Cann (ed.), DNA Virus 
Replication: Frontiers in Molecular Biology, Oxford University Press, Oxford). The E1B 
proteins inhibit cellular apoptosis. Id. These vectors are unable to replicate because they lack 
the El A genes required to induce Ad gene expression and DNA replication. In addition, the 
E3 genes are usually deleted because they are not essential for virus replication in cultured 

15 cells. 

A number of investigators have constructed replication-defective Ad vectors 
expressing anti-cancer therapeutic proteins. Usually, these vectors have been tested by direct 
injection of human tumors growing in mouse models. Most commonly, these vectors express 
the thymidine kinase gene from herpes simplex virus, and the mice are treated with 

20 gancyclovir to kill cells transduced by the vector (see e.g., Felzmann et al., Gene Ther. 

4: 1322-1329, 1997). Another suicide gene therapy approach involves injecting tumors with a 
replication defective Ad vector expressing cytosine deaminase, followed by administration of 
5-fluorocytosine (Topf et al., Gene Ther. 5:507-513, 1998). Investigators have also prepared 
and tested replication-defective Ad vectors expressing a cytokine-such as IL-2, IL-12, IL-6, 

25 tumor necrosis factor (TNF), type I interferons, or the co-stimulatory molecule B7-1 in the 
anticipation that the Ad-expressed cytokine will stimulate an immune response, including 
cytotoxic T-lymphocytes (CTL), against the tumor (Felzmann et al., supra; Putzer et al., Proc. 
Natl. Acad. Sci. USA 94: 10889-10894, 1997). Other vectors express tumor antigens (e.g. 
melanoma MARTI), proteins that de-regulate the cell cycle and induce apoptosis (p53, pRB, 

30 p 2i Ki » 1/WAF ', P 16 CDKN2 , and even Ad El A), and ribozymes. An Ad vector expressing FasL 
induces apoptosis and tumor regression of a mouse tumor (Arai et al., Proc. Natl. Acad. Sci. 
USA 94: 13862-13867, 1997). 

Despite these generally positive reports, it is recognized in the art that 
replication-defective Ad vectors have several characteristics that make them suboptimal for 

35 use in therapy. For example, production of replication-defective vectors requires that they be 
grown on a complementing cell line that provides the El A proteins in trans. Such cell lines 
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are fastidious, and generation of virus stocks is time-consuming and expensive. In addition, 
although many foreign proteins have been expressed from such vectors, the level of 
expression is low compared to Ad late proteins. 

To address these problems, several groups have proposed using replication- 
competent Ad vectors for therapeutic use. Replication-competent vectors retain Ad genes 
essential for replication and thus do not require complementing cell lines to replicate. 
Replication-competent Ad vectors lyse cells as a natural part of the life cycle of the vector. 
Another advantage of replication-competent Ad vectors occurs when the vector is engineered 
to encode and express a foreign protein. Such vectors would be expected to greatly amplify 
synthesis of the encoded protein in vivo as the vector replicates. However, in order to prevent 
RC vectors from damaging normal tissues and causing disseminated viremia, it is important 
that they have some feature that limits their replication to cancer cells. 

Wyeth Laboratories developed replication-competent Ad vectors for vaccination 
purposes, using vaccine strains of Ad serotypes 4, 7, and 5 (Lubeck etal., AIDS Res. Hum. 
Retroviruses 10: 1443-1449, 1994). Foreign genes were inserted into the E3 region (with the 
E3 genes deleted) or into a site at the right end of the genome. Two foreign genes used were 
hepatitis B surface antigen and the HTV envelope protein. They obtained good expression in 
culture, and were able to raise antisera in animal models. Phase I human trials were 
ambiguous, and the project was mostly abandoned. 

Onyx Pharmaceuticals recently reported on adenovirus-based anti-cancer vectors 
which are replication deficient in non-neoplastic cells but which exhibit a replication 
phenotype in neoplastic cells lacking functional p53 and/or retinoblastoma (pRB) tumor 
suppressor proteins (U.S. Patent No. 5,677, 1 78; Heise et al., Nature Med. 5:639-645, 1 997; 
Bischoff et al., Science 274:373-316, 1996). This phenotype is reportedly accomplished by 
using recombinant adenoviruses containing a mutation in the E1B region that make the 
encoded E1B-55K protein incapable of binding to p53 and/or a mutation(s) in the E1A region 
which make the encoded El A protein ( P 289R or p243R) incapable of binding to pRB and/or 
the cellular 300 kD polypeptide and/or the 107 kD polypeptide. E1B-55K has at least two 
independent functions: it binds and inactivates the tumor suppressor protein p53, and it is 
required for efficient transport of Ad mRNA from the nucleus. Because these E1B and E1A 
viral proteins are involved in forcing cells into S-phase, which is required for replication of 
adenovirus DNA, and because the p53 and pRB proteins block cell cycle progression, the 
recombinant adenovirus vectors described by Onyx should replicate in cells defective in p53 
and/or pRB, which is the case for many cancer cells, but not in cells with wild-type P 53 
and/or pRB. Onyx has reported that replication of an adenovirus lacking E1B-55K, which is 
named ONYX-015, was restricted to p53-minus cancer cell lines (Bischoff et al., supra), and 
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that ONYX-015 slowed the growth or caused regression of a p53-minus human tumor 
growing in nude mice (Heise et al., supra). Others have challenged the Onyx report claiming 
that replication of ONYX-015 is independent of p53 genotype and occurs efficiently in some 
primary cultured human cells (Harada and Berk, J. Virol 75:5333-5344, 1999). It is now 
known that ONYX-015 can replicate in cells with wild-type p53 (Goodrum et al., J. Virol. 
72:9479-9490, 1998; Harada et al., J. Virol. 75:5333-5344, 1999; Hay et al., Hum. Gene Ther. 
70:579-590, 1999;Rothmannetal.,y. Virol. 72:9470-9478, 1998; Turnell et al., J. Virol. 
75:2074-2083, 1999). ONYX-015 does not replicate as well as wild-type adenovirus because 
E1B-55K is not available to facilitate viral mRNA transport from the nucleus. Also, ONYX- 
015 expresses less ADP than wild-type virus (see Example 1 below). 

As an extension of the ONYX-015 concept, a replication-competent adenovirus 
vector was designed that has the gene for E1B-55K replaced with the herpes simplex virus 
thymidine kinase gene (Wilder et al., Gene Therapy 5:57-62, 1999). The group that 
constructed this vector reported that the combination of the vector plus gancyclovir showed a 
therapeutic effect on a human colon cancer in a nude mouse model (Wilder et al., Cancer Res. 
59:410-413, 1999). However, this vector lacks the gene for ADP, and accordingly, the vector 
will lyse cells and spread from cell-to-cell less efficiently than an equivalent vector that 
expresses ADP. The gene for ADP is also lacking in another replication-competent 
adenovirus vector that has been described, in which a minimal enhancer/promoter of the 
human prostate specific antigen was inserted into the adenovirus El A enhancer/promoter 
(Rodriguez et al., Cancer Res. 57:2559-2563, 1997). 

Another strategy for replication-competent vector improvement is to place replication 
under the control of tissue-specific promoters. One group replaced the basal El A promoter 
with a modified promoter for a-fetoprotein (AFP) (Hallenbeck et al., Hum. Gene Ther. 
70:1721-1733, 1999). AFP is expressed in the liver during development, but it is not 
expressed in adults. However, it is expressed in 70-80% of patients with hepatocellular 
carcinoma. Growth of this vector was limited to AFP-expressing cells and the vector showed 
some suppression of xenotransplants. Id. A series of RC vectors has also been developed 
that have expression of the E1A and E1B genes dependent on the prostate tumor-specific 
prostate specific antigen (PSA) and kallikrein promoters/enhancers (Rodriguez et al., Cancer 
Res. 60:1196, 1997; Yu et al., Cancer Ties. 59:4200-4203, 2000; Yu et al., Cancer Res 
59:1498-1504, 1999). 

Thus, there is a continuing need for vectors that replicate and spread efficiently in 
tumors but that can be modified such that they replicate poorly or not at all in normal tissue. 
Summary of the Invention 
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Briefly, therefore, the present invention is directed to novel vectors which are 
replication competent in neoplastic cells and which overexpress an adenovirus death protein 
(ADP). The work reported herein demonstrates the discovery that overexpression of ADP by 
a recombinant adenovirus allows the construction of a replication-competent adenovirus that 
kills neoplastic cells and spreads from cell-to-cell at a rate similar to or faster than that 
exhibited by adenoviruses expressing wild-type levels of ADP, even when the recombinant 
adenovirus contains a mutation that would otherwise reduce its replication rate in non- 
neoplastic cells. This discovery was unexpected because it could not have been predicted 
from what was known about adenovirus biology that Ad vectors overexpressing ADP remain 
viable and that the infected cells are not killed by the higher amounts of ADP before the Ad 
vector produces new virus particles that can spread to other tumor cells. Indeed, naturally- 
occurring adenoviruses express ADP in low amounts from the E3 promoter at early stages of 
infection, and begin to make ADP in large amounts only at 24-30 h p.i, once virions have 
been assembled in the cell nucleus. It is believed that other non-adenoviral vectors can be 
used to deliver ADP's cell-killing activity to neoplastic cells, including other viral vectors and 
plasmid expression vectors. 

Thus, in one preferred embodiment, the ADP-expressing vector comprises a 
recombinant adenovirus lacking expression of at least one E3 protein selected from the group 
consisting of: gpl9K; RIDa (also known as 10.4K); RIDp (also known as 14.5K) and 14.7K. 
Because these E3 proteins inhibit immune-mediated inflammation and/or apoptosis of Ad- 
infected cells, it is believed that a recombinant adenovirus lacking one or more of these E3 
proteins will stimulate infiltration of inflammatory and immune cells into a tumor treated with 
the adenovirus and that this host immune response will aid in destruction of the tumor as well 
as tumors that have metastasized. The ADP expressed by preferred embodiments comprises a 
naturally-occurring amino acid sequence from a human adenovirus of subgroup C, namely 
Adl,Ad2,Ad5andAd6. 

In another embodiment, replication of the vector is restricted to neoplastic cells. Such 
replication-restricted vectors are useful in treating cancer patients in which it is desirable to 
eliminate or reduce damage to norma] cells and tissues that might be caused by the vector, 
particularly viral vectors that kill the host cell as part of their life cycle. In preferred 
embodiments, a recombinant adenovirus has a replication-restricted phenotype because the 
recombinant adenovirus is incapable of expressing an El A viral protein which binds the pRB 
and the P 300/CBP proteins or because the E4 promoter has been substituted with a promoter 
that is activated only in neoplastic cells and/or cells of a specific tissue. 

In yet another embodiment, the invention provides a vector which overexpresses ADP 
and whose replication is under the control of a tissue specific promoter, tumor specific 
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promoter or an inducible promoter. In preferred embodiments, the vector c< 
recombinant adenovirus in which the tissue specific promoter or inducible promoter is 
substituted for the E4 promoter. Such vectors are useful for restricting replication of the 
vector and its ADP-mediated cell killing to cells of a particular type or to cells exposed to an 
5 exogenous agent that activates the promoter. A preferred tissue-specific or inducible vector 
also expresses a phenotype that restricts its replication to neoplastic cells. 

In yet another embodiment, the invention provides a vector which overexpresses ADP 
but which is not restricted to tumors by a specific genetic modification. Such a vector is more 
destructive to neoplastic cells than even the naturally occurring Ad's of subgroup C. In 
) preferred embodiments, this vector could be used for patients with terminal cancer not 
treatable by another method, and who have pre-existing neutralizing antibodies to Ad or to 
which neutralizing antibodies can be administered. 

In still another embodiment, the invention provides a composition comprising a first 
recombinant virus which is replication competent in a neoplastic cell and overexpresses the 
adenovirus death protein. In one embodiment, the recombinant virus is contained within a 
delivery vehicle comprising a targeting moiety that limits delivery of the virus to cells of a 
certain type. With this embodiment, the replication-competent vector can be of any ADP- 
overexpressing configuration described herein. In some embodiments, the composition also 
comprises a second recombinant vrnis which is replication-defective and which expresses an 
anti-cancer gene product In some embodiments, the replication-defective vector may be 
engineered to overexpress ADP when replication of this vector is complemented by a 
replication-competent vector. The recombinant virus complements spread of the replication- 
defective virus, as well as its encoded anti-cancer product, throughout a tumor. In preferred 
embodiments, the first recombinant virus is a recombinant adenovirus whose replication is 
restricted to neoplastic cells and/or which lacks expression of one or more of the E3 gpl9K; 
RIDa; RIDP; and 14.7K proteins. 

In additional embodiments, the invention provides replication-competent vectors that 
overexpresses an ADP and also expresses an anti-cancer product. As with previous 
embodiments, the vector can be of any ADP-overexpressing configuration provided herein. 
Preferably, replication of the virus is engineered to (a) be restricted to neoplastic cells, e.g., by 
replacing the E4 promoter with a tissue specific or tumor specific promoter and/or (b) lack' 
expression of one or more of the E3 gpl9K; RIDa; RIDP; and 14.7K proteins. In some 
embodiments, the anti-cancer product is inserted into the E3 region. 

The ADP-expressing vectors and compositions of the invention are useful in a 
method for promoting death of a neoplastic cell. The method comprises contacting the 
neoplastic cell with a vector which is replication-competent in the neoplastic cell and which 
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overexpresses ADP. Where the neoplastic cell comprises a tumor in a patient, the vector is 
administered directly to the tumor or, in other embodiments, the vector is administered to the 
patient systemically or in a delivery vehicle containing a targeting moiety that directs delivery 
of the vector to the tumor. In embodiments where the vector is a recombinant virus, the 
5 method can also comprise passively immunizing the patient against the virus. 

In yet another embodiment of the invention, the vector may be used in combination 
with radiation therapy. The radiation therapy can be any form of radiation therapy used in the 
art such as for example, external beam radiation such as x-ray treatment, radiation delivered 
by insertion of radioactive materials within the body near or at the tumor site such as 
1 0 treatment with gamma ray emitting radionuclides, particle beam therapy which utilizes 

neutrons or charged particles and the like. In addition, this embodiment encompasses the use 
of more than one of the vectors of the present invention in a cocktail in combination with 
radiation therapy. 

Another embodiment of the invention involves the use of the recombinant vector in 

1 5 combination with chemotherapy as has been disclosed for other adenovirus vectors (U.S. 
Patent No. 5,846,945). Chemotheraputic agents are known in the art and include 
antimetabolites including pyrimidine-analogue and purine-analogue antimetabolites, plant 
alkaloids, antitumor antibiotics, alkylating agents and the like. The use of more than one of 
the vectors of the present invention with a chemotheraputic agent or agents is also 

20 contemplated within this embodiment. 

Among the several advantages found to be achieved by the present invention, 
therefore, may be noted the provision of replication-competent vectors, particularly viruses, 
which rapidly kill cancer cells and spread from cell-to-cell in a tumor; the provision of such 
vectors whose replication can be induced or which is restricted to tumors and/or to cells of a 

25 certain tissue type; and the provision of compositions and methods for anti-cancer therapy 
which cause little to no side effects in normal tissues. 
Brief Description of the Drawings 

Figure 1 is a schematic of gene expression in Ad5 (Fig. 1A) and KD3, a preferred 
embodiment of the invention (Fig. IB), in which the respective genomes are represented by 

30 the stippled bars and transcription units represented by arrows above and below the bars, with 
the E3 proteins listed above the arrows for the E3 transcription unit, and the LI to L5 families 
of late mRNA's indicated. 

Figure 2 illustrates the overexpression of ADP by KD1, KD3 , GZ1, and GZ3 
showing an immunbblot of proteins isolated from human A549 cells infected with the 

35 indicated viruses and probed with an anti-ADP antibody, with ADP indicating differently 
glycosylated andproteolytically processed forms of ADP. 
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Figure 3 illustrates that the El A J/1 101/1 107 mutation referred to in the figure and 
hereinafter as J/01/07, retards expression of late proteins, showing an immunoblot of E1A 
proteins and late proteins in A549 cells infected with the indicated viruses in the absence 
(Figs. 3A and 3B) or presence (Figs. 3C and 3D) of J/327, which has a wild-type E1A region 
5 and has a deletion of all E3 genes but the gene encoding the 12.5K protein (Figs. 3C and 3D). 
An antiserum specific to the E1A proteins was used for Fig. 3A and 3C. An antiserum raised 
against Ad5 virions was used for Figs. 3B and 3D. 

Figure 4 illustrates that KD1 and KD3 kill cells more efficiently than control viruses 
that express less or no ADP, showing a graph of the percent of A549 cells infected with the 
1 0 indicated viruses that were viable at the indicated days p.i. as determined by trypan blue 
exclusion. 

Figure 5 is a cell spread assay illustrating that overexpression of ADP enhances 
spread of virus from cell to cell, showing monolayers infected with the indicated viruses at the 
indicated PFU/cell which were treated at 7 days p.i. with crystal violet, which stains live cells 

15 but not dead cells. 

Figure 6 illustrates that KD1 and KD3 replicate well in growing cells but not in 
growth-arrested cells showing the virus titer extracted from growing or growth arrested HEL- 
229 cells at various times following infection with 100 PFU/ml of the following viruses: 
J/309 (Fig. 6A), J/01/07 (fig. 6B), KD1 (Fig. 6C) and KD3 (Fig 6D). 

20 Figure 7 illustrates that KD1 and KD3 are defective in killing primary human 

bronchial epithelial cells showing these cell monolayers infected at 30% confluency with 10 
PFU/ml of the indicated viruses and stained at 5 days p.i. with neutral red. 

Figure 8 illustrates that KD1 and KD3 reduce the growth rate of human A549 cell 
tumors growing in nude mice, showing in Fig. 8A a graph of average-fold increase in tumor 

25 size plotted against the number of weeks following infection of the tumor with buffer or with 
5 x 10 7 PFU at weekly intervals of or the indicated viruses, and showing in Fig. 8B a similar 
graph of tumors injected once with 5 x 10 s PFU of KD3 or GZ3. 

Figure 9 illustrates that KD1 and KD3 reduce the growth rate of human Hep3B cell 
tumors growing in nude mice, showing a graph of average-fold increase in tumor size plotted 

30 against the number of weeks following injection of the tumor with buffer or with 5 x 10 7 PFU 
of J/309, KD1 or KD3 at twice weekly intervals of the indicated viruses. 

Figure 10 illustrates that KD1 and KD3 complement the replication and spread of Ad- 
(3-gal, a replication-defective vector that expresses (3-galactosidase, using an infectious center 
assay showing in Fig. 10A a picture of A549 cell monolayers seeded with A549 cells infected 

35 with Ad-p-gal alone or with the indicated viruses, with Figs 10B and 10C showing close-up 
views of two of the monolayers of Fig. 10A. 
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Figure 1 1 is a bar graph illustrating that KD1 and KD3 increase the expression of 
luciferase in human Hep3B cell tumors growing in nude mice, using an assay in which tumors 
were injected with the indicated combinations of viruses, then were extracted 2 weeks p.i. and 
assayed for luciferase activity. The numbers in parentheses indicated the fold increase in 
luciferase activity compared to that of the Adluc vector plus buffer. 

Figure 12 is a graph showing the results of a standard plaque development assay for 
KD1 and KD1-SPB on A549 cells engineered to express the TTF1 transcription factor 
(A549/TTF1) and the parental 549 cells, in which data are plotted as the number of plaques 
observed on a particular day in the assay divided by the final number of plaques observed for 
that virus multiplied by 100. 

Figure 13 is a cell spread assay for KD1 and KD1-SPB on H441 cells and Hep3B 
cells, where cells were infected with the indicated amounts of KD1 or KD1-SPB and H441 
cells and Hep3B cells were strained with crystal violet at 5 days p.i. and 8 days p.i., 
respectively. 

Figure 14 is a graph showing the results of a standard plaque development assay for 
<//309 and two preferred embodiments of the invention, GZ1 and GZ3, in which data are 
plotted as the number of plaques observed on a particular day in the assay divided by the final 
number of plaques observed for that virus multiplied by 100. 

Figure 15 is a cell spread assay illustrating that the combination of KD1, KD3, GZ1, 
or GZ3 with x-ray radiation is more effective in destroying A549 cell monolayers than is 
virus vector alone or radiation alone, wherein cells were infected with the indicated amounts 
of the indicated viruses, radiated with 600 centigreys (cGy) of x-radiation (bottom panel), or 
mock radiated (top panel), then stained with crystal violet at 6 days p.i. 

Figure 16 is a graph of a cell spread assay illustrating that 10" 3 PFU of KD1, KD3, 
GZ1, or GZ3 used in combination with 150, 300, or 600 centigreys of radiation is more 
effective in destroying A549 cell monolayers than virus vector alone or radiation alone. Cell 
viability is based on the amount of crystal violet extracted from the culture wells, using the 
mock-infected non-radiated well as 100% viability. 

Figure 17 illustrates that the combination of KD3 or GZ3 plus x-ray radiation is more 
effective in reducing the growth of A549 cell tumors growing in nude mice than KD3 alone or 
GZ3 alone. 

Figure 18 illustrates a structure-function analysis of ADP, showing in Fig. 18A the 
amino acid sequence of the adenovirus death protein encoded by Ad2, with the various 
putative domains and glycosylation sites labeled and showing in Fig. 18B a schematic of the 
ADP gene in rec700 and in the indicated deletion mutants, with the right column 
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summarizing the death promoting phenotype of the various mutants as a percentage of the 
wild-type phenotype. 

Figures 19A and 19B illustrate a cell viability assay of the indicated ADP mutants 
showing a graph of viability as determined by trypan blue exclusion plotted against hours 
5 (Fig. 1 9A) or days (Fig. 1 9B) postinfection. 

Figure 20 depicts the amino acid sequence, shown in single letter code, for the ADP 
proteins of Adl , Ad2, Ad5, and Ad6 (SEQ ID NOS:5-8), for the Ad2 ADP mutants dl7l 6, 
J/715, d/714, and rf/737 (SEQ ID NOS:9-12), and for the putative lumenal domain (SEQ ID 
NO: 17), the transmembrane domain (SEQ ID NO: 18), the cytosolic basic-proline domain 
1 0 (SEQ ID NO: 1 9), and the remainder of the cystosolic domain (SEQ ID NO:20) of the ADP 
protein of Ad2. 

Figure 21 presents the complete nucleotide sequence of the genome of Ad5. 

Figure 22 presents the complete nucleotide sequence of the genome of KD1 (SEQ ID 

NO:l). 

1 5 Figure 23 presents the complete nucleotide sequence of the genome of KD3 (SEQ ID 

NO:2). 

Figure 24 is a schematic of the following vectors: A. Ad5. The stippled bar 
indicates the DNA genome of 36 kbp. The open arrow indicates the immediate early El A 
transcription unit, and the black arrows are the delayed early E1B, E2, E3, and E4 

20 transcription units. The hatched arrows indicate the five families of major late mRNAs, and 
also the ADP mRNA, which is synthesized as part of the major late transcription unit. Each 
major late mRNA has a tripartite leader (leaders 1, 2, and 3) spliced to its 5' terminus. B. 
dl309. dl309 is identical to Ad5 except it has the E3-RTD and E3-14.7K genes deleted. dl309 
expresses ADP at levels similar to Ad5. C. KD1. KD1 has two small deletions (indicated by 

25 "X" marks) in the El A gene that abolish binding of the El A proteins to pRB or p300/CBP. It 
lacks all E3 genes except adp. ADP is expressed earlier in infection and in greater abundance 
than is ADP from Ad5 or dl309 Doronin et al., J. Virol. 74:6147-6155. D. KD1-SPB. KD1- 
SPB is identical to KD1, except it has the E4 promoter replaced by the promoter for 
Surfactant Protein B (SPB-P). 

30 Figure 25 presents graphs illustrating that KD1-SPB grows as well as KD1 in H441 

lung carcinoma cells but much more poorly than KD1 in Hep 3B hepatoma cells. CsCl- 
banded stocks of KD1-SPB and KD1 were titered using standard methods (Tollefson et al, p. 
1-9 In W.S.M. Wold (ed.), Adenovirus Methods and Protocols. Humana Press, Inc., Totowa, 
NJ, 1998) on 293-E4 or 293 cells (A), or on A549 cells (B). The data are plotted as the 

3 5 number of plaques seen on any day of the plaque assay as a percentage of the number of 
plaques seen on the final day of the assay (Tollefson et al. Virology 220:152-162, 1996). 
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Figure 26 presents micrographs illustrating that KD1-SPB induces CPE in H441 cells 
but not Hep 3B cells. H441 and Hep 3B monolayers were mock-infected or infected with 10 
PFU/cell of KD1 or KD1-SPB, then photographed under phase contrast at 4 or 7 days p.i. 
Figure 27 depicts Southern hybridizations and a graph illustrating that KD1-SPB 
5 DNA is synthesized efficiently in H441 but not Hep 3B cells. H441 or Hep 3B cells were 
infected with 10 PFU/cell of KD1 or KD1-SPB. Total genomic DNA was isolated at 0, 5, 24, 
48, 72, and 96 h p.i., digested with Hindlfl, resolved by agarose gel electrophoresis, blotted, 
and hybridized with 32 P-labeled Ad DNA. A. Autoradiogram. B. Phosphorlmager 
quantitation of the DNA bands in Panel A. 
1 0 Figure 28 presents graphs depicting single step growth curves showing that KD1-SPB 

grows well in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KD1 or 
KD1-SPB. Vectors were extracted at the indicated days p.i. and titers determined by plaque 
assay. 

Figure 29 depicts immunoblots showing that KD1-SPB expresses E40RF3 and ADP 
15 in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KD1 or KD1-SPB. At 
24 h p.i., protein extracts were analyzed for El A, E40RF3, and ADP using specific antisera. 
The El A proteins appear as multiple bands. ADP appears as two bands; the upper band is 
glycosylated and the lower band is a proteolytically cleaved species (Scaria et al., Virology 
7P/.-743-753, 1992; Tollefsonetal.,7. Virol. 65:3633-3642). 
20 Figure 30 depicts immunofluorescence micrographs showing that KD1-SPB 

expresses E40RF3 in H441 but not Hep 3B cells. Cells growing on coverslips were infected 
with 20 PFU/cell of KD1, KD1-SPB, or dl309 (wild-type). At 48 h (Panel A) or 6 days 
(Panel B), cells were fixed and stained with a rabbit polyclonal antipeptide antiserum against 
E40RF3. Photographs were taken using a 100X Planapo lens. Each panel shows about 8 
25 nuclei. This figure is part of the same experiment shown in Figure 3 1 . 

Figure 3 1 depicts immunofluorescence micrographs showing that KD1-SPB does not 
express E2-DBP or fiber efficiently in Hep 3B cells. Hep 3B cells were infected with 20 
PFU/cell of KD1-SPB or KD1 . At 48 h (A) or 6 days (B) p.i., cells were fixed and double- 
stained using a rabbit polyclonal antiserum against DBP and a mouse monoclonal antibody 
30 against fiber. The same fields are shown for DBP and fiber. This figure is part of the same 
experiment shown in Figure 30. 

Figure 32 presents graphs illustrating that KD1-SPB lyses H441 but not Hep 3B as 
efficiently as KD1. H441 or Hep 3B cells were mock-infected or infected with 20 PFU/cell 
of KD1 orKDl-SPB. Cell lysis was determined by release of lactate dehydrogenase from the 
35 cells into the medium. 
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Figure 33 presents graphs illustrating that KD1-SPB suppresses growth of H441 
rumors in nude mice equally as well as KD1. Tumor cells were injected into flanks of nude 
mice and allowed to grow to about 100 ul (H441) or 150 ul (Hep 3B) volumes. Tumors (n = 
10) were injected with DMEM (mock) or with 5 x 10 7 PFU of KD1 or KD1-SPB. Injections 
of the viruses were repeated twice weekly for 3 weeks to a total dose of 3.0 x 10 s PFU per 
tumor. Tumors were measured and the mean fold-increase in tumor size was calculated. 
Description of the Preferred Embodiments 

In accordance with the present invention, it has been discovered that overexpression 
of ADP by a recombinant adenovirus results in faster lysis of cells and spread of the virus 
throughout a cell monolayer than viruses expressing wild-type levels of ADP. It has also 
been discovered that this function for ADP is manifest in an adenovirus that contains E1A 
mutations that restrict adenoviral replication to neoplastic cells. Thus, vectors which are both 
replication competent in neoplastic cells and which overexpress ADP should be useful in anti- 
cancer therapy. 

In the context of this disclosure, the following terms will be defined as follows unless 
otherwise indicated: 

"Naturally-occurring" as applied to an object such as a polynucleotide, polypeptide, 
or virus means that the object can be isolated from a source in nature and has not been 
intentionally modified by a human. 

"Neoplastic cell" means a cell which exhibits an aberrant growth phenotype 
characterized by a significant loss of control of cell proliferation and includes actively 
replicating cells as well as cells in a temporary non-replicative resting state (G, or G 2 ). A 
neoplastic cell may have a well-differentiated phenotype or a poorly-differentiated phenotype 
and may comprise a benign neoplasm or a malignant neoplasm. 

"Recombinant virus" means any viral genome or virion that is different than a wild- 
type virus due to a deletion, insertion, or substitution of one or more nucleotides in the wild- 
type viral genome. The recombinant virus can have changes in the number of amino acid 
sequences encoded and expressed or in the amount or activity of proteins expressed by the 
virus. In particular, the term includes recombinant viruses generated by the intervention of a 
human. 

"Replication-competent" as applied to a vector means that the vector is capable of 
replicating in normal and/or neoplastic cells. As applied to a recombinant virus, "replication- 
competent" means that the virus exhibits the following phenotypic characteristics in normal 
and/or neoplastic cells: cell infection; replication of the viral genome; and production and 
release of new virus particles; although one or more of these characteristics need not occur at 
the same rate as they occur in the same cell type infected by a wild-type virus, and may occur 
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at a faster or slower rate. Where the recombinant virus is derived from a virus such as 
adenovirus that lyses the cell as part of its life cycle, it is preferred that at least 5 to 25% of the 
cells in a cell culture monolayer are dead 5 days after infection. Preferably, a replication- 
competent virus infects and lyses at least 25 to 50%, more preferably at least 75%, and most 
» preferably at least 90% of the cells of the monolayer by 5 days post infection (p.i.). 

"Replication-defective" as applied to a recombinant virus means the virus is incapable 
of, or is greatly compromised in, replicating its genome in any cell type in the absence of a 
complementing replication-competent virus. Exceptions to this are cell lines such as 293 cells 
that have been engineered to express adenovirus El A and E1B proteins. 

"Replication-restricted" as applied to a vector of the invention means the vector 
replicates better in a dividing cell, i.e. either a neoplastic cell or a non-neoplastic, dividing 
cell, than in a cell of the same type that is not neoplastic and/or not dividing, which is also 
referenced herein as a normal, non-dividing cell. Preferably, a replication-restricted virus 
kills at least 10% more neoplastic cells than normal, non-dividing cells in cell culture 
monolayers of the same size, as measured by the number of cells showing cytopathic effects 
(CPE) at 5 days p.i. More preferably, between 25% and 50%, and even more preferably, 
between 50% and 75% more neoplastic than normal cells are killed by a replication-restricted 
virus. Most preferably, a replication-restricted adenovirus kills between 75% and 100% more 
neoplastic than normal cells in equal sized monolayers by 5 days p.i. 

In one embodiment the invention provides a vector that is replication-competent in 
neoplastic cells and which overexpresses an ADP. Vectors useful in the invention include but 
are not limited to plasmid-expression vectors, bacterial vectors such as Salmonella species 
that are able to invade and survive in a number of different cell types, vectors derived from 
DNA viruses such as human and non-human adenoviruses, adenovirus associated viruses 
(AAVs), poxviruses, herpesviruses, and vectors derived from RNA viruses such as 
retroviruses and alphaviruses. Preferred vectors include recombinant viruses engineered to 
overexpress an ADP. Recombinant adenoviruses are particularly preferred for use as the 
vector, especially vectors derived from Adl, Ad2, Ad5 or Ad6. 

Vectors according to the invention overexpress ADP. As applied to recombinant Ad 
and AAV vectors, the term "overexpresses ADP" means that more ADP molecules are made 
per viral genome present in a dividing cell infected by the vector than expressed by any 
previously known recombinant adenoviral vector or AAV in a dividing cell of the same type. 
As applied to other, non-adenoviral vectors, "overexpresses ADP" means that the virus 
expresses sufficient ADP to lyse a cell containing the vector. 

Vectors overexpressing ADP can be prepared using routine methodology. See, e.g., 
A Laboratory Cloning Manual, 2nd Ed., vol. 3, Sambrook et al., eds., Cold Spring Harbor 
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Laboratory Press, 1989. For example, a polynucleotide encoding the ADP can be cloned into 
a plasmid expression vector known to efficiently express heterologous proteins in mammalian 
cells. The polynucleotide should also include appropriate termination and polyadenylation 
signals. Enhancer elements may also be added to the plasmid to increase the amount of ADP 
5 expression. Viral vectors overexposing ADP can be prepared using similar materials and 
techniques. 

Where the virus is a recombinant adenovirus, overexpression of ADP can be achieved 
in a multitude of ways. In general, any type of deletion in the E3 region that removes a splice 
site for any of the E3 mRNAs will lead to overexpression of the mRNA for ADP, inasmuch 
1 0 as more of the E3 pre-mRNA molecules will be processed into the mRNA for ADP. This is 
exemplified in the KD1, KD3, GZ1 and GZ3 vectors (SEQ ID NOS: 1-4) whose construction 
is described below. Other means of achieving overexpression of ADP in Ad vectors include, 
but are not limited to: insertion of pre-mRNA splicing and cleavage/polyadenylation signals ' 
at sites flanking the gene for ADP; expression of ADP from another promoter, e.g. the human 
15 cytomegalovirus promoter, inserted into a variety of sites in the Ad genome; and insertion of 
the gene for ADP behind the gene for another Ad mRNA, together with a sequence on the 5' 
side of the ADP sequence that allows for internal initiation of translation of ADP, e.g. the Ad 
tripartite leader or a viral internal ribosome initiation sequence. 

The ADP expressed by a vector according to the invention is any polypeptide 
!0 comprising a naturally-occurring full-length ADP amino acid sequence or variant thereof that 
confers upon a vector expressing the ADP the ability to lyse a cell containing the vector such 
that replicated copies of the vector are released from the infected cell. A preferred full-length 
ADP comprises the ADP amino acid sequence encoded by Adl, Ad2, Ad5 or Ad6. These 
naturally-occurring ADP sequences are set forth in SEQ ID NOS:5-8, respectively. ADP 
5 variants include fragments and deletion mutants of naturally-occurring adenovirus death 
proteins, as well as full-length molecules, fragments and deletion mutants containing 
conservative amino acid substitutions, provided that such variants retain the ability, when 
expressed by a vector inside a cell, to lyse the cell. 

Conservative amino acid substitutions refer to the interchangeability of residues 
D having similar side chains. Conservatively substituted amino acids can be grouped according 
to the chemical properties of their side chains. For example, one grouping of amino acids 
includes those amino acids having neutral and hydrophobic side chains (A, V, L, I, P, W, F, 
and M); another grouping is those amino acids having neutral and polar side chains (G, S, T, 
Y, C, N, and Q); another grouping is those amino acids having basic side chains (K, R, and ' 
H); another grouping is those amino acids having acidic side chains (D and E); another 
grouping is those amino acids having aliphatic side chains (G, A, V, L, and I); another 
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grouping is those amino acids having aliphatic-hydroxyl side chains (S and T); another 
grouping is those amino acids having amine r containing side chains (N, Q, K, R, and H); 
another grouping is those amino acids having aromatic side chains (F, Y, and W); and another 
grouping is those amino acids having sulfur-containing side chains (C and M). Preferred 
! conservative amino acid substitutions groups are: R-K; E-D, Y-F, L-M; V-I, and Q-H. 

As used herein, an ADP variant can also include modifications of a naturally- 
occurring ADP in which one or more amino acids have been inserted, deleted or replaced with 
a different amino acid or a modified or unusual amino acid, as well as modifications such as 
glycosylation or phosphorylation of one or more amino acids so long as the ADP variant 
containing the modified sequence retains cell lysing activity. 

As described below, the inventors herein performed a structure-function analysis of 
ADP that defined specific domains in ADP required to promote cell death. Using this 
information, when combined with known recombinant DNA and cloning methodology, it is 
believed the skilled artisan can readily construct ADP variants of a naturally-occurring 
adenovirus death protein and test them for cell lysing activity. A preferred ADP deletion 
mutant comprises an ADP amino acid sequence from any of the deletion mutants rf/716, 
d/715, rf/714 and <f/737, whose ADP sequences are set forth in SEQ ID NOS:9-12, 
respectively). 

Where the vector is derived from a virus, it is preferred that the virus lack expression 
of one or more viral proteins involved in avoiding host anti-viral defenses such as immune- 
mediated inflammation and/or apoptosis of infected cells. For example, adenovirus contains a 
cassette of genes that prevents killing of Ad-infected cells by the immune system (Wold et bL, 
Semin. Virol., 1998 (8:515-523, 1998). The E3-14.7K protein and the E3 RID (Receptor 
Internalization and Degradation) protein, which is a complex consisting ofRIDa and RTDp, 
inhibit apoptosis of Ad-infected cells induced by tumor necrosis factor (TNF) and the Fas 
ligand which are expressed on, or secreted by, activated macrophages, natural killer (NK) 
cells, and cytotoxic lymphocytes (CTLs) (Tollefson et al., Nature 592:727-730, 1998). The 
E3-gpl9K protein inhibits CTL-killing of infected cells by blocking transport of MHC class I 
antigens to the cell surface (Wold et al., supra). Thus, it is believed that infection of tumor 
cells by such viral vectors will stimulate infiltration of inflammatory cells and lymphocytes 
into the tumor, and will not prevent infected tumor cells from apoptosis induced by cytolytic 
cells of the immune system, or against apoptosis inducing cytokines. For example, it is 
known that when mice are infected with Ad mutants lacking the E3 gpl9K, RID and 14.7K 
proteins there is a dramatic increase (as compared to E3-positive Ad) in infiltration of 
inflammatory cells and lymphocytes into the infected tissue (Sparer et al., J. Virol. 70:2431- 
2439, 1996). A similar infiltration of tumors infected by an ADP-expressing viral vector of 
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the invention would be expected to further promote destruction of the tumor by adding an 
immune system attack to the ADP-mediated killing activity. For example, it is believed that 
the viral infection will stimulate formation of tumor-specific CTL's that can kill neoplastic 
cells not only in the tumor but also ones that have metastasized. In addition, it is also 
5 expected that vector-specific CTL's will be generated which could attack vector-infected cells 
if the vector spreads away from the tumor into normal cells. Because viral vectors 
overexpressing ADP will spread rapidly through the tumor, it is believed these immune 
mechanisms will have little effect on spread of the vector. 

Where the vector is a recombinant adenovirus, it is preferred that the adenovirus lack 
) expression of each of the E3 gpl9K, RID, and 14.7K proteins. By "lack expression" and 
"lacking expression" of a protein(s), it is meant that the viral genome contains one or more 
mutations that inactivates expression of a functional protein, i.e., one having all the functions 
of the wild-type protein. The inactivating mutation includes but is not limited to substitution 
or deletion of one or more nucleotides in the encoding gene(s) that prevents expression of 
functional transcripts or that results in transcripts encoding nonfunctional translation products. 
A particularly preferred way to inactivate expression of the Ad E3 gpl9K, RID, and 14.7K 
proteins is by deleting the E3 region containing the genes encoding these proteins. 
Preferably, one or both of the E3 genes encoding the E3 6.7K and 12.5K proteins are also 
deleted because, as discussed in the Examples below, it is believed that deletion of most or all 
of the E3 genes other than the ADP gene facilitates overexpression of ADP mRNA by 
reducing competition for splicing of the major late pre-mRNAs. Preferred Ad vectors 
containing an E3 deletion that overexpress ADP are GZ1 (SEQ ID NO:3) and GZ3 (SEQ ID 
NO:4), whose construction and properties are described in the Examples below. 

The invention also provides ADP-expressing vectors whose replication is restricted to 
dividing cells. Any means known to provide such a replication-restricted phenotype may be 
used. For example, WO 96/40238 describes microbes that preferentially invade tumor cells 
as well as methods for identifying and isolating bacterial promoters that are selectively 
activated in tumors. It is also contemplated that expression of one or more vector proteins 
essential for replication can be placed under the control of the promoter for a cellular gene 
whose expression is known to be unregulated in neoplastic cells. Examples of such genes 
include but are not limited to: the breast cancer markers mammaglobin (Watson et al. 
Oncogene 7*817-824, 1998); BRCA1 (Norris et al.,./ Biol. Chenu 270:22777. HUl, 1995) 
her2/neu (Scott et al.,./. Biol. Chem. 269: 19848-19858, 1994); prostate specific antigen (U.S. 
Patent 5,698,443); surfactant protein B for lung alveoli (Yan et al., J. Biol. Chem. 270:24852- 
24857, 1995); factor VH for liver (Greenberg et al, Proc. Nail. Acad. Sci. USA 92:12347- 
12351, 1995); and survivin for cancer in general (Li et al., Waft/re J96- : 580-584). Wherethe 
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vector is an adenovirus, it is contemplated that such tumor-specific promoters can be 
substituted for the E4 promoter. Because E4 gene products are essential for Ad replication, 
placing their expression under the control of a tumor-specific promoter should restrict 
replication of the vector to tumor cells in which the promoter is activated. 
5 Another strategy for restricting replication of ADP-expressing Ad vectors to 

neoplastic cells is exemplified by the KD1 (SEQ ID NO: 1), KD2 (SEQ ID NO: 1 3) and KD3 
(SEQ ID NO:2) vectors, whose construction and properties are described in the Examples 
below. This strategy exploits a pre-existing Ad5 mutant in the El A gene, named dll 101/1 107 
(Howe et al., Proc. Natl. Acad. ScL, 57:5883-5887, 1990), also referred to herein as <//01/07, 

10 and which can only grow well in cancer cells. The role of El A is to drive cells from the G 0 
and Gi phases of the cell cycle into S-phase. This is achieved by two mechanisms, one 
involving pRB (and family members), and the other involving p300 and the related protein 
CBP (DePinho, RA., Nature 597:533-536, 1998). One domain in E1A binds members of the 
pRB family. pRB normally exists in the cell as a complex with the transcription factor E2F-1 

1 5 and E2F family members (E2F), tethered via E2F to E2F binding sites in promoters of cells 
expressed in S-phase. Here, pRB acts as a transcriptional co-repressor. El A binding to pRB 
relieves this repression, and causes the release of E2F from pRB/E2F complexes. Free E2F 
then activates promoters of genes expressed in S-phase, e.g. thymidine kinase, ribonucleotide 
reductase, etc. Another domain in El A binds the p300/CBP transcription adaptor protein 

20 complex. p300/CBP is a transcriptional co-activator that binds many different transcription 
factors and accordingly is targeted to promoters. p300/CBP has intrinsic histone 
acetyltransferase activity. El A binding to p300/CBP is believed to inhibit this histone 
acetyltransferase activity, allowing acetylation of histones and repression of transcription 
(Chakravarti et al., Cell 96:393^*03, 1999; Hamamori et al., Cell 95:405-413, 1999). 

25 Conceivably, some of the genes that are repressed as a result of El A interacting with 

p300/CBP to play a role in blocking the cell cycle, although this is not known. Cancer cells 
are cycling, so they have free E2F and presumably some p300/CBP-regulated genes are 
repressed. Consistent with these ideas, El A must bind both p300/CBP and the pRB family in 
order to transform primary cells to a constitutively cycling state (Howe et al., supra). The 

30 mutant dlO 1/07 lacks both the p300/CBP- and pRB-binding domains and, as expected, it 

replicates very poorly in non-dividing "normar* cells or serum-starved cancer cells, but well 
in growing cancer cells. As described below, the growth of the KD1 and KD3 vectors, which 
contain the rf/01/07 El A mutation, is very much better in dividing cancer cells as compared to 
non-dividing cells. Because the <//01/07 mutant is completely defective in oncogenic 

35 transformation of rat cells (Howe et la., supra), vectors according to the invention that contain 
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this El A mutation cannot induce cancer in humans (remote as that may be) through an E1A- 
dependent mechanism. 

The invention also includes vectors overexpressing ADP whose replication is 
restricted to specific tissues by placing expression of one or more proteins essential for 
replication under the control of a tissue specific promoter and/or a tumor specific promoter. 
A number of tissue-specific and/or tumor specific promoters have been described in the art. 
Non-limiting examples include the surfactant protein B promoter, which is only active in cells 
containing the TTF1 transcription factor (i.e., type II alveolar cells (Yan et al., supra)), as 
described in U.S. Patent 5,466,596 to Breitman et al., which directs gene expression 
specifically in cells of endothelial lineage; prostate specific antigen which is expressed in 
prostate cells (Rodriguez et al., supra); human telomerase protein (hTERT) promoter (see, 
e.g., U.S. Patent No. 6,054,575); and human alpha-lactalbumin gene which is expressed in' 
breast cancer cells (Anderson et al., Gene Therapy (5:854-864, 1999). Many other tissue- 
specific, tumor specific, or tissue-preferred enhancer/promoters have been reported (Miller 
and Whelan, Human Gene Therapy 5:803-815, 1997). As exemplified with the surfactant 
protein B promoter in Examples 6 and 10, vectors expressing tissue-specific promoters would 
be expected to show tissue specificity in viral replication, viral spreading, cell lysis, and 
tumor suppression. 

Replication of vectors according to the invention can also be controlled by placing 
one or more genes essential for vector replication under the control of a promoter that is 
activated by an exogenous inducing agent, such as metals, hormones, antibiotics, and 
temperature changes. Examples of such inducible promoters include but are not limited to 
metallothionein promoters, the glucocorticoid promoter, the tetracycline response promoter, 
and heat shock protein (hsp) promoters such as the hsp 65 and 70 promoters. 

The invention also provides compositions comprising a recombinant vector that 
overexpresses ADP in an amount effective for promoting death of neoplastic cells and a 
method comprising administering a therapeutically effective amount of the vector to a 
neoplastic cell in a patient. It is believed the compositions and methods of the present 
invention are useful for killing neoplastic cells of any origin and include neoplastic cells 
comprising tumors as well as metastatic neoplastic cells. 

It is also contemplated that ADP-expressing viral vectors can be administered to 
neoplastic cells along with a replication-defective virus that expresses an anti-cancer gene 
product. For example, many replication-defective El" Ad vectors for use in cancer therapy 
are well characterized. A limitation of replication-defective vectors is that they only 
synthesize the therapeutic protein in the cell they initially infect, they cannot spread to other 
cells. Also, since the genome does not replicate, transcription can only occur from the input 



Copied from 10206747 on 0M)4~2004 



WO 01/04282 



PCT7US0O/18971 



20 

genomes, and this could be as low as one copy per cell. In contrast, the genome of 
replication-competent Ad vectors are amplified by about 10* in the cell that was initially 
infected, providing more templates for transcription. More amplification is achieved as the 
vector spreads to other cells. By combining replication-defective viral vectors expressing an 
5 anti-cancer gene product with replication-competent viral vectors described herein, it is 
expected that the result will be template amplification and rapid spread of both vectors to 
surrounding cells. For example, with Ad-based vectors, the burst size for each vector should 
be large, ~10 4 PFU/cell, so the probability of co-infection of surrounding cells by both vectors 
will be high. Thus, both the replication-competent and replication-defective vectors should 

1 0 spread simultaneously through the tumor, providing even more effective anti-cancer therapy. 

As an alternative method of delivering an anti-cancer gene product with an ADP 
overexpressing Ad vector, the anti-cancer gene can be engineered into any of the ADP 
overexpressing replication-competent vectors described herein, in order to provide both the 
ADP and the anti-cancer function in a single vector. The anti-cancer gene can be engineered 

1 5 into any appropriate location of the vector, as can be easily determined by the skilled artisan. 
For example, the anti-cancer gene can be engineered into the E3 region. 

Expression of the anti-cancer gene product encoded by the replication-defective 
vector can be under the control of either constitutive, inducible or cell-type specific 
promoters. The anti-cancer gene product can be any substance that promotes death of a 

20 neoplastic cell. The term "gene product" as used herein refers to any biological product or 
products produced as a result of the biochemical reactions that occur under the control of a 
gene. The gene product can be, for example, an RNA molecule, a peptide, a protein, or a 
product produced under the control of an enzyme or other molecule that is the initial product 
of the gene, i.e., a metabolic product. For example, a gene can first control the synthesis of an 

25 RNA molecule which is translated by the action of ribosomes into a prodrug converting 

enzyme which converts a nontoxic prodrug administered to a cancer patient to a cell-killing 
agent; the RNA molecule, enzyme, and the cell-killing agent generated by the enzyme are all 
gene products as the term is used here. Examples of anti-cancer gene products include but are 
not limited to cell-killing agents such as apoptosis-promoting agents and toxins; prodrug 

30 converting enzymes; angiogenesis inhibitors; and immunoregulatory molecules and antigens 
capable of stimulating an immune response, humoral and/or cellular, against the neoplastic 
cell. 

Apoptosis-promoting agents include but are not limited to the pro-apoptotic members 
of the BCL-2 family such as BAX, BAD, BID and BIK, as well as antisense molecules which 
35 block expression of anti-apoptotic members of the BCL-2 family. Examples of 

immunoregulatory molecules are cytokines such as tumor necrosis factor, Fas/Apol/CD95 
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ligand, tumor necrosis factor related apoptosis inducing ligand, interleukins, macrophage 
activating factor and interferon y. Angiogenesis inhibitors include but are not limited to 
endostatin and angiostatin. Toxins include but are not limited to tumor necrosis factor, 
lymphotoxin, the plant toxin ricin, which is not toxic to humans due to the lack of ricin 
receptors in animal cells, and the toxic subunit of bacterial toxins. Examples of pro-drug 
converting enzymes and pro-drug combinations are described in WO 96/40238 and include 
thymidine kinase and acyclovir or gancyclovir; and bacterial cytosine deaminase and 5- 
fluorocytosine. 

The therapeutic or pharmaceutical compositions of the present invention can be 
administered by any suitable route known in the art including for example by direct injection 
into a tumor or by other injection routes such as intravenous, subcutaneous, intramuscular, 
transdermal, intrathecal and intracerebral. Administration can be either rapid as by injection 
or over a period of time as by slow infusion or administration of slow release formulation. 
For treating tissues in the central nervous system, administration can be by injection or 
infusion into the cerebrospinal fluid (CSF). When it is intended that a recombinant vector of 
the invention be administered to cells in the central nervous system, administration can be 
with one or more agents capable of promoting penetration of the vector across the blood-brain 
barrier. Preferably, vectors of the invention are administered with a carrier such as liposomes 
or polymers containing a targeting moiety to limit delivery of the vector to targeted cells. 
Examples of targeting moieties include but are not limited to antibodies, ligands or receptors 
to specific cell surface molecules. 

Compositions according to the invention can be employed in the form of 
pharmaceutical preparations. Such preparations are made in a manner well known in the 
pharmaceutical art. One preferred preparation utilizes a vehicle of physiological saline 
solution, but it is contemplated that other pharmaceutically acceptable carriers such as 
physiological concentrations of other non-toxic salts, five percent aqueous glucose solution, 
sterile water or the like may also be used. It may also be desirable that a suitable buffer be 
present in the composition. Such solutions can, if desired, be lyophilized and stored in a 
sterile ampoule ready for reconstitution by the addition of sterile water for ready injection. 
The primary solvent can be aqueous or alternatively non-aqueous. 

The carrier can also contain other pharmaceutically-acceptable excipients for 
modifying or maintaining the pH, osmolality, viscosity, clarity, color, sterility, stability, rate 
of dissolution, or odor of the formulation. Similarly, the carrier may contain still other 
pharmaceutically-acceptable excipients for modifying or maintaining release or absorption or 
penetration across the blood-brain barrier. Such excipients are those substances usually and 
customarily employed to formulate dosages for parenteral administration in either unit dosage 
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or multi-dose form or for direct infusion into the cerebrospinal fluid by continuous or periodic 
infusion. 

It is also contemplated that certain formulations containing ADP-expressing vectors 
are to be administered orally. Such formulations are preferably encapsulated and formulated 
5 with suitable carriers in solid dosage forms. Some examples of suitable carriers, excipients, 
and diluents include lactose, dextrose, sucrose, sorbitol, mannitol, starches, gum acacia, 
calcium phosphate, alginates, calcium silicate, microcrystalline cellulose, 
polyvinylpyrrolidone, cellulose, gelatin, syrup, methyl cellulose, methyl- and 
propylhydroxybenzoates, talc, magnesium, stearate, water, mineral oil, and the like. The 

1 0 formulations can additionally include lubricating agents, wetting agents, emulsifying and 
suspending agents, preserving agents, sweetening agents or flavoring agents. The 
compositions may be formulated so as to provide rapid, sustained, or delayed release of the 
active ingredients after administration to the patient by employing procedures well known in 
the art. The formulations can also contain substances that diminish proteolytic degradation 

1 5 and promote absorption such as, for example, surface active agents. 

The specific dose is calculated according to the approximate body weight or body 
surface area of the patient or the volume of body space to be occupied. The dose will also be 
calculated dependent upon the particular route of administration selected. Further refinement 
of the calculations necessary to determine the appropriate dosage for treatment is routinely 

20 made by those of ordinary skill in the art. Such calculations can be made without undue 

experimentation by one skilled in the art. Exact dosages are determined in conjunction with 
standard dose-response studies. It will be understood that the amount of the composition 
actually administered will be determined by a practitioner, in the light of the relevant 
circumstances including the condition or conditions to be treated, the choice of composition to 

25 be administered, the age, weight, and response of the individual patient, the severity of the 
patient's symptoms, and the chosen route of administration. Dose administration can be 
repeated depending upon the pharmacokinetic parameters of the dosage formulation and the 
route of administration used. 

The invention also contemplates passively immunizing patients who have been 

30 treated with a viral vector overexpressing ADP. Passive immunization can include 

administering to the patient antiserum raised against the viral vector, or gamma-globulin or 
vector-specific purified polyclonal or monoclonal antibodies isolated from the antiserum. 
Preferably, the patient is passively immunized after a time period sufficient for the viral 
vector to replicate in and spread through the tumor. 

3 5 Preferred embodiments of the invention are described in the following examples. 

Other embodiments within the scope of the claims herein will be apparent to one skilled in the 
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art from consideration of the specification or practice of the invention as disclosed herein. It 
is intended that the specification, together with the examples, be considered exemplary only, 
with the scope and spirit of the invention being indicated by the claims which follow the 
examples. 

Example 1 

This example illustrates the construction and characterization of the KD1 and KD3 
anti-cancer vectors. 

To construct KD1, the inventors deleted the entire E3 region of a unique plasmid, 
leaving behind only a unique Pad site for cloning. The starting plasmid was pCRII, 
purchased from Invitrogen, containing the Ad5 BamHIA fragment having a deletion of all the 
E3 genes; the E3 deletion is identical to that for KD1 and GZ3, the sequences of which are 
given in SEQ ID NO:l and SEQ ID NO:4, respectively. The ADP gene from Ad5 was cloned 
into the Pad site, then built into the E3 region of the genome of the Ad5 E1A mutant named 
tf/01/07. This was done by co-transfecting into human embryonic kidney 293 cells the 
aforementioned BamHIA fragment containing the ADP gene together with the overlapping 
EcoRIA restriction fragment obtained from <f/01/07. Complete viral genomes are formed 
within the cell by overlap recombination between the Ad sequences in the BamHIA fragment 
in the plasmid and the EcoRIA fragment. KD3 was constructed in the same way except the 
E3 gene for the 12.5K protein was retained in the starting plasmid. A vector named KD2, 
which marginally overexpress ADP, was also prepared. Plaques of each recombinant Ad 
were picked, screened, purified, expanded into CsCl-banded stocks, sequenced, titered, and 
characterized. GZ1 and GZ3 are Ad vectors that are identical to KD1 and KD3, respectively, 
except that GZ1 and GZ3 have wild-type El A sequences as found in AD5 or in the Ad5 
mutant <//309. GZ1 and GZ3 were constructed as described for KD1 and KD3 except that the 
EcoRIA fragment of Ad5 was used for GZ1 and GZ3. 

KD1 and KD3 were characterized in cell culture by infecting the human A549 lung 
carcinoma cell line with high titer (1-8 x 10'° plaque forming units [PFU] per ml) virus stocks 
of one of these recombinant vectors, or with one of the control viruses J/01/07, J/309, J/327, 
and Ad5 (wt). Fifty PFU per cell were used for each virus. The descriptions of these viruses 
as well as some other viruses used in these examples are presented in Table 1 . 
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Using a polymerase chain reaction (PCR)-based protocol, an in-frame stop codon was 
introduced into the gene for the E3-gpl9K protein in the E3 region of the Ad5 mutant dl309 
(Jones and Shenk, Cell 77:683-689, 1979). The mutagenesis was conducted using a Sunl- 
Bstl 1071 fragment, nucleotides 28,390 to 29,012 in the Ad5 genome, which was then 
substituted for the equivalent fragment in <//309. d/01/07 is the parent for KD1 and KD3. In 
turn, the Ad5 mutant named dl309 is the parent of rf/01/07, i.e. dl309 is identical to rf/01/07 
except that rf/309 does not have the El A mutation. Both <//01/07 and rf/309 have deletions of 
the genes for the E3 RJDa, RIDp and 14.7K proteins but retain the gene for ADP. The Ad5 
mutant rf/327 has wild-type El A, it lacks the gene for ADP, and its lacks all other E3 genes 
except the one for the 12.5K protein. 

At 24 and 36 hours post-infection (h p.i.), proteins were extracted from the A549 cells 
and analyzed for ADP by immunoblot using a rabbit antiserum against ADP (Tollefson et al., 
J. Virol. 66:3633-3642, 1992). The results are shown in Figure 2. Much more ADP was 
detected at 24 and 36 h p.i. in KD1- and KD3-infected cells than in cells infected with 
rf/01/07. Also, much more ADP was synthesized by GZ1 and GZ3 than <//309 or the other 
viruses. Most importantly, KD1, KD3, GZ1, and GZ3 expressed much more ADP at 24 h p.i. 
than did <f/01/07 or rf/309 (Fig. 2). This result is consistent with an observation discussed 
below that the cells infected with KD1, KD3, GZ1, or GZ3 lyse faster, and that these viruses 
spread from cell to cell faster than rf/01/07 or rf/309. It is noteworthy that KD1, KD3, GZ1, 
and GZ3 express much more ADP at 24 and 36 h p.i. than the Ad5 mutant if/1520 (Fig. 2); 
rf/1520 is the original name given to ONYX-015 (Heise et al., Nature Medicine 3:639-645, 
1997). As expected, no ADP was detected in cells infected with pm734.l (Fig. 2), a mutant 
that lacks amino acids 1 to 48 in ADP (Tollefson et al., J. Virol. 70:2296-2306, 1996). 
Expression of the ElAproteins by rf/01/07, KD1, KD2, and KD3 was slightly less than by 
Ad5, rf/309, or dim, and as expected from the d/01/07 deletion, the proteins were smaller 
(Fig. 3A). dliTI is isogenic withd/324 (Thimmappaya et al., 1982 Ce// 37:543-51, 1983), 
and it lacks the gene for ADP and all other E3 proteins except the 12.5K protein. 

The amount of ADP detected in the KD1 and KD3 infected cells is significantly 
higher than the amount detected in the <//309 infected cells (Fig. 2). If one takes into 
consideration the fact that the viruses with the El A mutation replicate somewhat slower, as 
evidenced in by the delayed appearance of the late proteins (Fig. 3B), it is clear that KD1 and 
KD3 express much more ADP per viral genome present in the cell than d/309. This finding is 
supported by the fact that when A549 cells are reinfected with a virus containing the E1A 
mutation and dim, which lacks ADP but has wild-type E1A, the replication rates of the El A 
mutant viruses speed up, as indicated by earlier appearance of late proteins (compare Figs. 3B 
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and 3D). Thus, J/327 complements the El A mutation. In conclusion, these experiments 
demonstrate that ADP is dramatically overexpressed by KD1 , KD3, GZ1, and GZ3. ADP is 
marginally overexpressed by KD2 (not shown). 

Example 2 

This example illustrates that KD1 and KD3 lyse cells more rapidly and spread from 
cell-to cell faster than other adenoviruses. 

The ability of KD1 and KD3 to lyse cells was examined by a trypan blue exclusion 
cell viability assay which was performed essentially as described by Tollefson et al., J. Virol. 
70:2296-2306, 1996. In brief, A549 cells were mock-infected or infected with 20 PFU/cell of 
KD1, KD3, J/01/07, J/327 or J/309. At various days p.i., the number of viable cells was 
determined using a hemocytometer (600 to 1000 cells were counted per time point) and the 
results are shown in Fig. 4. 

Only 25% of the KD1 -infected cells and 9% of the KD3-infected cells were alive at 5 
days p.i. as compared to 44% of cells infected with J/01/07, which has the same E1A 
mutation as KD1 and KD3. The KD1 and KD3 vectors also lysed cells faster than J/309, 
which has a wild-type El A region. When infected with J/327 (ADP", El A*), 94% of the cells 
were alive after 5 days. When cell lysis was estimated by release of lactate dehydrogenase, 
KD1 and KD3 once again lysed cells faster than J/01/07 and J/309, and J/327 caused little 
cell lysis (data not shown). Thus, ADP is required for efficient cell lysis, and over-expression 
of ADP increases the rate of cell lysis. 

As another means to measure cell lysis and to examine virus replication in cancer 
cells, separate groups of A549 cells were infected with 20 PFU/cell of KD1, KD3, J/01/07, or 
J/309 and the amount of intracellular and extracellular virus was determined by plaque assay 
on A549 cells. At 2 days p.i., the total amount of virus formed in each group was similar, 2-4 
x 10 8 PFU/ml, indicating that replication of all the viruses is similar. However, when the ratio 
of extracellular to intracellular virus was calculated, the value for KD1 and KD3 was 2-3 logs 
higher than for Ad5, J/309, or J/0 1/07 (data not shown). Thus, virus is released much more 
rapidly from cells infected with KD1 and KD3, which overexpress ADP, than with viruses 
expressing wild-type amounts of ADP. 

The ability of KD1 and KD3 to spread from cell-to-cell was measured in a "cell 
spreading" assay. In this assay monolayers of A549 cells in a 48 well culture dish were mock- 
infected or infected with 10' 3 , 10 2 , 10"', 10°, or 10 PFU/cell of J/327, J/309, Ad5, J/01/07, 
KD1 or KD3. At low PFU/cell, the viruses must go through two or three rounds of 
replication in order to infect every cell in the monolayer. At 1 .0 and 10 PFU/cell, the 
monolayer should be destroyed by the virus that initially infected the cells. To assess the 
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amount of spread in the monolayers by 7 days p.i., crystal violet, which stains live cells but 
not dead cells, was added to the monolayers. The results are shown in Fig. 5. 

Remarkably, at 7 days p.i., the monolayer was virtually eliminated by KD1 and KD3 
at 10" 3 PFU/cell, whereas 1.0 PFU/cell was required with J/01/07, J/309 and Ad5. This result 
5 attests to the potency of ADP in mediating cell lysis and virus spread in A549 cells. KD1 and 
KD3 are also more effective that J/01/07 in killing other types of human cancer cell lines 
(most purchased from the American Type Culture Collection [ATCC]) as determined in this 
cell spreading assay. KD1 and/or KD3 killed HeLa (cervical carcinoma), DU145 (prostate), 
and pC3 (prostate) cells at 10" 2 PFU/cell, ME- 180 (cervix) and Hep3B (liver) at 10"' PFU/cell, 

10 and Ul 18 (glioblastoma) and U373 (glioblastoma) at 10 PFU/cell. From 10- to 100-fold 
more J/01/07 was required to kill these cells (data not shown). These results indicate that 
KD1 and KD3 may be effective against many types of cancer. 

An important aspect of the finding that ADP overexpressing vectors lyse cells at very 
low multiplicities of infection is that the multiplicity of infection in human tumors is likely to 

1 5 be low at sites distal to the sight of vector injection or distal to blood vessels that carry the 
vector to the tumor. Thus, ADP overexpressing vectors have an advantage over vectors that 
express less ADP or no ADP at all. 

Example 3 

This example illustrates that KD1 and KD3 replicate poorly in non-growing non- 
20 cancerous cells. The replication phenotype of KD 1 and KD3 was evaluated using "normal" 
HEL-299 human fibroblast cells, either growing in 10% serum or rendered quiescent using 
0.1% serum. All Ads should replicate well in growing cells, but viruses with the J/01/07 E1A 
mutation should do poorly in quiescent cells because El A is required to drive them out of G 0 . 
J/309, which has wild-type El A, should replicate well in both growing and growth-arrested 
25 cells. 

Cells were infected with 100 PFU/cell of KD1, KD3, dIOl/01, or J/309. At different 
days p.i., virus was extracted and titered. In 10% serum, KD1, KD3, and J/01/07 replicated 
well, reaching titers of 10 6 -10 7 PFU/ml, only slightly less than J/309 (Fig. 6). However, in 
quiescent cells, replication of KD1, KD3, and J/01/07 was 1 .5-2 logs lower than in growing 

30 cells, ranging from 10 4 to 2 x 10 5 PFU/ml. The titer of J/309 reached 10 7 PFU/ml, nearly the 
level achieved in growing cells. At 10 days p.i., quiescent HEL-299 cell monolayers infected 
with 100 PFU/cell of KD1, KD3, or J/01/07 were intact, whereas those infected with J/309 or 
J/327, which have wild-type E1A, showed strong typical Ad cytopathic effect indicative of 
cell death (data not shown). Thus, replication of KD1 and KD3 is severely restricted to 

35 growing cell lines. 
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The restriction associated with the rf/01/07 El A mutation was also tested in primary 
human cells (purchased from Clonetics) growing as monolayers. Bronchial epithelial cells 
(Fig. 7) and small airway epithelial cells were not killed by 10 PFU/cell of KD1, KD3, or 
d/01/07 at 5 days p.i., whereas they were killed by 10 PFU/cell of d/309 or d/327 (data not 
shown). Lung endothelial cells also were not killed after 10 days by KD1, KD3, or d/01/07 at 
10 PFU/cell, but they were killed by 1 PFU/cell of rf/309. These monolayers were 
subconfluent when initially infected, then grew to confluency. The exciting result here is that 
although these primary cells were growing, they did not support replication in this time frame 
and were not killed by KD 1 or KD3. Thus, it is believed these vectors will be restricted to 
cancerous cells, and will have little to no effect on cells such as basal cells that are normally 
dividing in the body. In addition, it is unlikely that KD1 and KD3 will affect dividing 
leukocytes because such cells are poorly infected by Ad. 

In summary, the above experiments demonstrate that KDI and KD3 lyse cancer cells, 
spread from cell-to-cell rapidly, and replicate poorly in quiescent and non-cancerous cells. 
These properties should make them useful in anti-cancer therapy. 

Example 4 

This example illustrates that KDI and KD3 inhibit the growth of human tumors in an 
animal model. 

We could not evaluate mouse or rat tumors in normal mice or rats because they are 
totally non-permissive. Human cancer cell lines growing in nude mice have been used by 
Onyx Pharmaceuticals (Richmond, CA) to evaluate the efficacy of ONYX-015, an Ad vector 
lacking expression of the E1B 55 kDa protein (Heise et al., Nature Med. 5:639-645, 1997). 
We have found that A549 cells, which were used in many of our cell culture studies, form 
excellent rapidly growing solid tumors when injected subcutaneously into nude mice. The 
average tumor reaches ca. 500 ul in four weeks, and is encapsulated, vascularized, and 
attached to the mouse skin (usually) or muscle. 

Nude mice were inoculated into each hind flank with 2 x 10 7 A549 cells. After 1 
week tumors had formed, ranging in size from about 20 ul to 50 ul. Individual tumors were 
injected three days later, and at subsequent weeks for 4 weeks (total of 5 injections), with 50 
ul of buffer or 50 ul of buffer containing 5 x 10 7 PFU of dim, rf/01/07, KDI, KD3, or 
pmllAA, with a total virus dose per tumor of 3 x 10 8 PFU. The mutant pm734. 1 lacks ADP 
activity due to two nonsense mutations in the gene for ADP, but all other Ad proteins are 
expected to be synthesized at wild-type levels (Tollefson et al., /. Virol 70.2296-22Q6, 1996). 
The efficacy of each virus (or buffer) was tested on six tumors. At weekly intervals, the 
length (L) and width (W) of tumors were measured using a Mitutoyo digital caliper. Tumor 
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volumes were calculated by multiplying L x W x W/2. This value was divided by the tumor 
volume at the time of the initial virus injection, the fold-increase in tumor growth was 
calculated, and the average for the six tumors was graphed. 

As shown in Fig. 8A, tumors that received buffer continued to grow, increasing about 
14-fold by 5 weeks. In contrast, tumors injected with which expresses normal amounts 
of ADP and lacks the E3 RID and 14.7K and proteins, only grew about 2.5-fold by 5 weeks. 
With pmllA. 1 , which lacks ADP, the tumors grew as well as those that received buffer. 
Thus, <//309 markedly decreases the rate of tumor growth, and ADP is required for this 
decrease. Tumors inoculated with d/01/07 grew about 8-fold over 5 weeks. Since rf/01/07 is 
identical to J/309 except for the El A mutation, this result indicates that the El A mutation 
significantly reduces the ability of Ad to prevent growth of the tumors. This effect is 
probably due to a reduction in virus replication in the tumors resulting in lower ADP 
expression, but it could also reflect other properties of El A in the tumor cells, e.g. the 
inability of the mutant El A proteins to induce apoptosis. Most importantly, tumors 
inoculated with KD1 or KD3 only grew about 2.5-fold. Thus, the overexpression of ADP by 
KD1 and KD3 allows KD1 and KD3 to reduce tumor growth to a rate markedly slower than 
rf/01/07 (their parental control virus), and even to a rate similar to that of <#309. 

The finding that KD1 and KD3 are as effective as wild-type Ad (i.e. d/309) in 
reducing the rate of A549 tumor growth is highly significant in the context of cancer 
treatment, inasmuch as KD I and KD3 are restricted to cancer cells whereas wild-type Ad 
does not have such a restriction. 

The tumors in Fig. 8 A received five injections of vectors, but only one dose of vector, 
in this case 5 x 10 8 of each of KD3 or GZ3, is sufficient to significantly reduce the rate of 
A549 tumor growth (Fig. 8B). 

We have also found that KD1 and KD3 reduce the rate of growth in nude mice of a 
human liver cancer cell line, Hep3B cells. These cells form rapidly growing tumors that are 
highly vascularized. Nude mice were inoculated into each hind flank with 1 x 10 7 of Hep3B 
cells. After tumors reached about 100 uJ, they were injected twice per week for 3 weeks with 
50uJ of buffer or 5 x 10 7 PFU of KD1, KD3, or rf/309. There were typically 8-10 tumors per 
test virus. The tumor sizes were measured and the fold increase in size at 0 to 3.5 following 
the initial virus injection was graphed as described above for the A549 tumors. Tumors that 
received buffer alone grew 9-fold over 3 weeks and were projected to grow about 12-fold 
over 3.5 weeks (after 3 weeks the mice had to be sacrificed because the tumors were 
becoming too large) (Fig. 9). Tumors that received KD1 or KD3 grew about 4-fold, 
establishing that KD1 and KD3 reduce the growth of Hep3B tumors in nude mice. Tumors 



Copied from 10206747 on {)/-{) A -'?{){) A 



WO 01/04282 



PCT/USOO/18971 



that were injected with J/309 grew 2-fold (Fig. 9). The finding that KD1 and KD3 were 
somewhat less effective than J/309 is probably due to the fact that they do not grow as well as 
J/309 in Hep3B cells, as indicated by a cell spread assay in culture (data not shown). In any 
case, the important points are that KD1 and KD3 are effective against the Hep3B tumors, and 
5 that they contain the E 1 A mutation that limits their replication to cancer cells. 

These results point to the potency of ADP as an anti-tumor agent when expressed in 
an Ad vector. It is highly probable that KD1 and KD3 will provide significant clinical benefit 
when used to infect tumors growing in humans. 

Example 5 

1 0 This example illustrates the use of replication-defective Ad vectors in combination 

withKDl orKD3. 

It is well established that replication-competent (RC) viruses complement replication- 
defective (RD) mutants. That is, when the same cell is infected, the competent virus will 
supply the protein(s) that cannot be made from the mutant genome, and both viruses will 

1 5 grow. To test the ability of KD 1 and KD3 to complement RD viruses, two RD vectors 
expressing P-galactosidase were constructed. The first, named Ad-P-gal, has a cDNA 
encoding P-gal under the control of the Rous Sarcoma Virus promoter substituted for the 
deleted El region. Ad-P-gal also has the E3 region deleted, including the gene for ADP. The 
second, named Ad-P-gal/FasL is identical to Ad-P-gal, except that it also expresses murine 

20 FasL from the human cytomegalovirus promoter/enhancer. These vectors were constructed 
by overlap recombination in human 293 cells that constitutively express the Ad E1A and E1B 
genes and complement replication of the El -minus vectors. 

These RD vectors should infect and express P-gal in A549 cells, but should not 
replicate because the El A proteins are lacking. However, the vectors should replicate when 

25 cells are co-infected with RC Ads. To prove this, A549 cells were infected with 10 PFU/cell 
of Ad-p-gal alone, or with 10 PFU/cell of Ad-p-gal plus 10 PFU/cell of KD1, KD3, J/01/07, 
J/309, or J/327. At 2 days p.i., vims was extracted and Ad-P-gal titers determined by P-gal 
expression in A549 cells. The yields are shown in Table 2 below. 
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Table 2 



Virus 


Yield 
(blue plaques per ml) 


Ad-p-gal 


1 x 10 2 


Ad-P-gal+KDl 


2xl0 5 


Ad-P-gal + KD3 


3xl0 5 


Ad-p-gal + dlO 1/07 


4xl0 4 


Ad-p-gal + J/309 


3xl0 5 


Ad-p-gal + <//327 


3.0 x10 s 



The data in Table 2 indicate that the complementing viruses increased the yield of Ad-P-gal 
by about 10 3 . 

5 A key feature of KD1 and KD3 is that they spread from cell-to-cell faster than other 

Ads. Accordingly, they should complement the spread of Ad-p-gal. To test this, an 
infectious center assay was conducted. A549 cells were infected with Ad-P-gal plus KD1, 
KD3, or rf/01/07. After 2 h, cells were collected, diluted, and seeded onto monolayers of 
fresh A549 cells. After 4 days, the cells were stained with X-gal and the results are shown in 
10 Fig. 10. 

With Ad-P-gal alone, only the originally infected cell (before seeding) should be 
stained, and the vector should not spread to other cells on the seeded monolayer. This was 
indeed the case. In monolayers seeded with A549 cells infected with Ad-p-gal alone (dish 
shown in the top left of Fig. 10A) contained a number of individual blue cells (not visible in 

15 the print); examples are shown in the enlarged view Fig. 10B. However, when the 

monolayers were seeded with A549 cells coinfected with Ad-P-gal and KD1 or KD3, there 
were numerous "comets" of blue cells (Fig. 10A). Each comet represents Ad-P-gal which has 
spread from one initially-infected cell. Most of the cells within a comet were stained with X- 
gal (Fig. 10C). Comets were also observed with d/01/07, but not to the extent of KD1 and 

20 KD3 (Fig. 10A). With <//327 (ADP ), there was little spread from the originally infected cell 
(data not shown). In summary, KD1 and KD3 not only complement the replication of Ad-p- 
gal, they also enhance its rapid spread. 

It is expected that KD1 and KD3 will also complement and enhance the spread of RD 
vectors expressing anti-cancer therapeutic gene products, and this expectation can be readily 
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verified using the Ad-P-gal/FasL in replication and infectious center assays as described 
above. 

KD1 and KD3 not only complement the replication of RD vectors in cell culture, they 
also do so in Hep3B tumors growing in the hind flanks of nude mice. The RD vector used 
5 was AdLuc, an Ad that lacks the E 1 and E3 regions, and has inserted into the E 1 region an 
expression cassette where the firefly luciferase gene is expressed from the Rous Sarcoma 
Virus promoter (Harrod et al., Human Gene Therapy 9:1885-1898, 1998). The Hep3B tumors 
were injected with 1 x 10 7 PFU of AdLuc plus buffer, or 1 x 10 7 PFU of AdLuc plus 5 x 10 7 
PFU of KD1, KD3, dlQMQl, or d/309. After 2 weeks, mice were sacrificed and tumors 
1 0 excised. Proteins were extracted from the tumors and luciferase activity determined using a 
luminometer. The luciferase counts per tumor were 6,800 for AdLuc plus buffer, 1 13,500 for 
KD1, and 146,900 for KD3 (Fig. 1 1). Thus, KD3 and KD1 respectively caused a 22-fold and 
1 7-fold increase in luciferase activity. This increase could be due to elevated synthesis of 
luciferase in cells that were initially coinfected the AdLuc and KD1 or KD3, and it could also 
1 5 be due to spread of AdLuc from cell to cell in the tumor as mediated by KD1 or KD3. 

In summary, infecting a tumor with a replication-competent ADP-overexpressing 
vector according to the invention together with a RD vector expressing an anti-cancer gene 
product should greatly increase the amount of anti-cancer protein synthesized in the rumor 
thereby increasing the ability of the replication-defective vector to promote destruction of the 
20 tumor. 

Example 6 

This example illustrates the construction and characterization of a recombinant Ad 
vector according to the invention which is replication-restricted to cancerous type II alveolar 
cells. 

25 As demonstrated above, the dlO 1 /07 mutation in KD 1 and KD3 limits growth of these 

vectors to cancer cells. To further restrict their replication phenotype, the E4 promoter in 
each virus was deleted and replaced by the surfactant protein B (SPB) promoter to produce 
vectors named KD1-SPB (SEQ ID NO: 14), KD3-SPB (SEQ ID NO: 15), and d/01/07-SPB 
(SEQ ID NO:16). The SPB promoter is only active in cells containing the TTF1 transcription 

30 factor, which has thus far been found primarily in type II alveolar cells of the human lung 
(Lazzaro et al., Development 775:1093-1 104, 1991). Thus, KD1-SPB, KD3-SPB, and 
<7/01/07-SPB should be severely restricted to cancerous type H alveolar cells of the human 
lung. Many lung cancers are of this type. 

The KD1-SPB and KD3-SPB vectors were prepared as follows. The E4 promoter is 

3 5 located at the right end of the Ad genome (Fig. 1 ). Using a pCRH-based plasmid (Invitrogen) 
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containing the Ad5 DNA sequences from the BamHI site (59 map units) to the right hand end 
of the genome, and using and a PCR-based protocol, nearly all the transcription factor binding 
sites were deleted from the E4 promoter Ad5 base pairs 35,623 to 35,775 and replaced with a 
500 base pair fragment containing the SPB promoter (Yan et al., J. Biol. Chem. 270:24852- 
5 24857, 1995). The final plasmids contain the E4-SPB substitution in the E4 region and the 
c//01/07, KD1 , or KD3 versions of the E3 region, respectively, for the viruses <f/01/07-SPB, 
KD1-SPB, and KD3-SPB. These plasmids were co-transfected into 293 cells with a fragment 
containing the left portion of the genome of dlOMQl, and plaques were allowed to develop. 
Plaques were screened for the expected features, purified, then expanded into a stock. 
1 0 The A549-TTF1 cell line was developed in order to test the prediction that replication 

of rf/01/07-SPB, KD1-SPB, and KD3-SPB would be restricted to cancerous cells expressing 
the TTF1 transcription factor. These cells were co-transfected with two plasmids, one in 
which TTF1 is expressed from the CMV promoter, and the other coding for resistance to 
neomycin Resistant clones were isolated and shown to express TTF1 activity as determined 
15 by transient transfection with a plasmid expressing chloramphenicol acetyltransferase from 
the TTF1 -requiring surfactant protein C promoter. 

KD1-SPB and KD1 were subjected to a standard plaque development assay on A549- 
TTF1 cells and parental A549 cells. The results are shown in Fig. 12. With KD1-SPB on 
A549 cells, plaques were not visible after 8 days, only about 4% of the final number of 
20 plaques were seen after 10 days, and about 50% of final plaques were seen after 12 days. 
With KD1-SPB on A549-TTF1 cells, plaques were visible after 6 days, and about 60% of 
plaques were seen after 10 days. Thus, as expected, KD1-SPB grew significantly faster on 
the cells containing TTF1 . KD1 formed plaques more quickly than KD1-SPB on both A549 
and A549-TTF1 cells, indicating that the E4 promoter-SPB substitution is not as effective the 
25 wild-type E4 promoter in inducing Ad replication. However, this difference between KD1- 
SPB and KD1 on A549-TTF1 cells is tolerable, with KD1-SPB delayed only about 1 day. 
Curiously, the final titer obtained for all virus stocks by day 16 was similar, indicating that 
A549 cells may contain a very small amount of endogenous TTF1 activity. It is predicted that 
KD3-SPB and d/01/07-SPB will behave similarly to KD1-SPB when grown in A549-TTF1 
30 cells and A549 cells. 

The restriction of KD1-SPB to cells containing TTF1 was further examined in a cell 
spread assay using H441 cells, a TTF1 -expressing human pulmonary adenocarcinoma cell 
line (Yan et al, supra), and Hep3B cells, a liver cancer cell line not expected to express 
TTF1 . Culture dish wells containing H441 or Hep3B cells were infected with KD1-SPB or 
35 KD1 at multiplicities ranging from 10 to 10" 4 PFU/cell. The H441 and Hep3B cells were 
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stained with crystal violet at 5 days and 8 days p.i., respectively. KD1-SPB and KD1 grew 
and spread equally well on H441 cells, causing destruction of the monolayer at 10"' PFU per 
cell (Fig. 13). (Some of the H441 monolayer has peeled off in the well with KD1-SPB at 10' 2 
PFU per cell, and in the wells with KD1 and KD1-SPB at 10" 4 PFU per cell; this occasionally 
i occurs in cell spread assays, and it does not reflect virus infection). With Hep3B cells, KD 1 
grew and spread very much better than KD1-SPB, with 10" 2 PFU per cell of KD1 causing 
more destruction of the monolayer as 1.0 PFU per cell of KD1-SPB (Fig. 13). 

In summary, this example demonstrates that a replication-competent Ad, which 
replicates well on cells expressing the appropriate transcription factor, can be constructed 
with a tissue-specific promoter substituted in place of the E4 promoter. This methodology 
should be applicable to many other tissue specific and cell type specific promoters. One 
possibility would be a liver-specific promoter. Another possibility would be to use the E2F 
promoter, or another promoter with E2F sites, inasmuch as that promoter would be active 
only in cells such as cancer cells that have free E2F. A third possibility would be to use a 
regulatable promoter, e.g. the synthetic tetracycline response promoter (Massie et al., J. Virol 
72. 2289-2296, 1998), where the activity of the promoter is controlled by the level of 
tetracycline or a tetracyclin analog in the patient. 

Example 7 

This example illustrates the construction and characterization of vectors which 
overexpress ADP and are not replication restricted. 

As demonstrated above, the J/01/07 El A mutation in KD1 and KD3 is attenuating, 
inhibiting growth in non-dividing and even in dividing primary human epithelial and 
endothelial cells. Ads with this mutation are able to replicate well in dividing cancer cells. 
However, replication of such E1A mutants is not as efficient as, e.g. <//309 which has a wild- 
type El A gene. For instance, the rate of replication of rf/01/07, as determined by the rate at 
which plaques develop, is reduced such that d/01/07 plaques appear one day later than those 
of rf/309 (data not shown). This delay is due in part to a delay in expression of Ad late genes 
(see Fig. 3). The idea that the rf/01/07 mutation retards the rate of replication in A549 cells is 
further supported by the data in Fig. 8A, where J/01/07 did not prevent tumor growth nearly 
as well as d/309. Despite this negative effect of the J/0 1/07 El A mutation, there are 
theoretical and practical aspects of having this mutation in the KD1 and KD3 vectors, as has 
been discussed. Nevertheless, one can easily imagine scenarios (e.g. patients with terminal 
cancer) where the ability of an Ad vector to destroy the rumor supercedes the requirement that 
the vector be totally restricted to tumor cells. In such cases, it would be advantageous to have 
vectors similar to KD1 and KD3, but with the wild-type El A gene. The rates at which such 
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vectors express their genes, lyse cells, and spread from cell to cell should be higher than those 
of KD1 and KD3. Such vectors might cause some damage to non-cancerous cells and tissue, 
but this is also true for other modes of anti-cancer treatment such as surgery, chemotherapy, 
and radiation therapy. 

5 In light of these considerations, vectors named GZ1 and GZ3 have been constructed 

that are identical to KD1 and KD3, respectively, except they have a wild-type El A region. 
These vectors were constructed by overlap recombination in A549 cells. The left hand 
fragment contained the wild-type El A region of Ad5, and the right end fragment contained 
the E3 modifications of KD1 or KD3. Plaques were picked, analyzed for the expected 

1 0 genotype, plaque-purified, and expanded into CsCl-banded stocks. The titers of these stocks 
on A549 cells were 2.9 x 10 10 PFU/ml for GZ1 and 1 .6 x 10" PFU/ml for GZ3. Thus, these 
vectors can be grown into high titer stocks comparable to wild-type Ad. The GZ1 and GZ3 
plaques are larger and appear much sooner than the plaques for J/309. Large rapidly- 
appearing plaques reflect the ability of Ad to lyse cells and spread from cell-to-cell (Tollefson 

15 etal.,y. Virol. 70:2296-2306, 1996; Tollefson etal., Virology 220:152-162, 1996), and this 
property, as discussed, is due to the function of ADP. 

The rate of plaque appearance can be quantitated in a plaque development assay 
(Tollefson et al., supra). Here, a typical plaque assay is performed, and the plaques observed 
on subsequent days of the assay are calculated as a percentage of the number of plaques 

20 observed at the end of the plaque assay. As shown in Fig. 14, after 4 days of plaque assay on 
A549 cells, GZ1 and GZ3 had 48% and 34%, respectively, of the final number of plaques, 
whereas J/309 had only 1%. It is very unusual in Ad plaque assays in A549 cells for plaques 
to appear after only 4 days. These large plaques reflect the overexpression of ADP. These 
GZ1 and GZ3 plaques appear sooner than those of KD1 and KD3 (data not shown), no doubt 

25 because GZ1 and GZ3 replicate faster because they have a wild-type El A region. 

GZ1 and GZ3 lyse cells and spread from cell to cell much more effectively than 
J/309. At 6 days p.i. of A549 cells, approximately as much monolayer destruction was 
observed with GZ1 and GZ3 at 10 3 PFU per cell as was observed with J/309 at 10" 1 PFU per 
cell (Fig. 15, top panel). This result further underscores the conclusion that overexpression of 

30 ADP promotes cell lysis and virus spread. 

In theory, GZ1 and GZ3 should be able to replicate not only in tumor cells but also in 
normal cells. Although they can replicate in normal cells, it is quite possible that GZ1 and 
GZ3 may be useful as anti-cancer vectors. First, GZ1 and GZ3 could be injected directly into 
the tumor. Many tumors are self-contained (encapsulated) except for the blood supply. The 
35 physical barriers of the tumor could minimize dissemination of the virus to other tissues. 
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Second, Ads are in general quite benign. Most infections of Ad5 are in infants and result in 
mild or asymptomatic disease, and are held in check by strong humoral and cellular 
immunity. Anti-Ad immunity appears to be life-long. GZ1 and GZ3 could be used only in 
patients who have an intact immune system, and perhaps also with pre-existing anti-Ad 
5 immunity. Further, patients could be passively immunized against Ad, using gamma-globulin 
or even specific purified anti-Ad neutralizing antibodies. Third, considering that Ad5 is a 
respiratory virus which most efficiently infects lung epithelial cells displaying the specific 
Ad5 receptor (named CAR) as well as specific integrins (e.g. a v b5), replication-competent 
vectors derived from Ad5 may not spread efficiently in many non-cancer tissues of the body. 
1 0 In addition, it is believed that versions of GZ1 and GZ3 can be constructed that have the E4 
promoter substituted with a tumor-specific, tissue-specific, cell-specific, or synthetic 
promoter. Such vectors would have the positive features associated with wild-type El A and 
ADP, and yet be replication-restricted to tumor tissue and/or to particular cell types. 

Example 8 

15 This example illustrates that the combination of KD1, KD3, GZ1, or GZ3 with 

radiation is more effective in destroying A549 cells, growing in culture or growing as tumors 
in nude mice, than the vectors alone or radiation alone. 

This was shown in a cell spread assay. A549 cells growing in three 48 well culture 
dishes were mock-infected or infected with different viruses at multiplicities of infection 
20 ranging from 10 to lO^PFU per cell as indicated in Fig. 15. One dish was not radiated. A 
second dish received 600 centrigreys (cGy) of radiation at 24 h p.i., and a third dish received 
2000 cGy of radiation at the same time. All dishes were stained with crystal violet at 6 days 
p.i. With the cells that were not radiated (top panel in Fig. 15), KD1 and KD3 caused 
monolayer destruction at lower multiplicities of infection than their parental control, dlOMOl. 
This was also true for GZ1 and GZ3 as compared to their parental control <//309. (The 
paucity of cells in the cells infected with GZ1 or GZ3 at 10"* PFU per cell is an experimental 
artifact, and is not caused by infection by GZ1 or GZ3). These KD1, KD3, GZ1 and GZ3 
results are consistent with earlier results showing that overexpression of ADP leads to 
increased cell lysis and virus spread. 

With the dish that was infected then radiated with 600 cGy there was markedly 
increased cell killing and virus spread as compared to the non-radiated cells (compare the 
bottom panel of Fig. 15 with the top panel). For example, with KD1, KD3, GZ1, and GZ3 
there was about the same amount of cell destruction in the radiated wells at 10" 4 PFU per cell 
as in the non-radiated wells at 10" 2 PFU per cell. Similar results were seen with the dish that 
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received 2000 cGy of radiation (data not shown), and also with dishes that received 600 or 
2000 cGy of radiation 24 h prior to infection (data not shown). 

The amount of cell destruction was quantitated by extracting the crystal violet from 
the cells with 33% acetic acid, then measuring the absorbance at 490 nm (data not shown). 
The absorbance with non-radiated mock-infected cells was set at 100% cell viability. With 
mock-infected cells that received 600 cGy there was a 15% loss in viability (i.e. 15% less 
crystal violet was extracted). With KD1 at 10° PFU per cell, the non-radiated cells were 80% 
viable whereas the cells receiving 600 cGy of radiation were only about 30% viable. Similar 
differences in viability between radiated and non-radiated cells were seen with KD3, GZ1, 
and GZ3. These results argue that the combination of radiation plus vector has a synergistic 
effect on cell lysis and vector spread, rather than an additive effect. If the effect were only 
additive, then with the KD1 samples at 10" 3 PFU per cell, the cell viability should have been 
65% (15% reduction in viability due to radiation alone, 20% reduction due to KD1 alone). In 
fact, the cell viability was 30% rather than 65%. 

As mentioned, approximately as much cell lysis and virus spread were observed with 
600 cGy as with 2000 cGy. To determine the optimal dose of radiation to synergize with the 
vectors, an experiment similar to the one described above was conducted with mock-, 
<//01/07-, KD1-, KD3-, rf/309, GZ1-, or GZ3-infected A549 cells. The 48 well plates received 
0, 150, 300, or 600 cGy of radiation at 24 h p.i. Cells were stained with crystal violet. The 
results with cells receiving 0 versus 600 cGy of radiation were similar to those in Fig. 15. 
The crystal violet was extracted from the cells infected with 10' 3 PFU per cell of the 
difference viruses. The absorbance of crystal violet was determined, and the percent cell 
viability was graphed, using the absorbance of the non-radiated mock-infected cells as 100% 
cell viability. As illustrated in Fig. 16, an approximately linear decrease in cell viability in all 
wells was obtained with increasing radiation dose, although the slope of the line was more 
negative with KD1, KD3, GZ1, or GZ3 than with mock, rf/01/07, or rf/309. With KD1, KD3, 
GZ1, and GZ3, there was much more cell lysis and vector spread with their parental control 
viruses, and there was synergy between the vectors and radiation. For example, with mock- 
infected cells, 600 cGy reduced cell viability by about 30% (70% of cells were viable). KD1 
without radiation reduced cell viability by about 23%. The combination of 600 cGy radiation 
plus KD1 reduced cell viability to about 85%, more than 53% of which is the sum of radiation 
alone and KD1 alone. When considering the data in Figs. 15 and 16 together, a dose of about 
600 cGy is optimal in this type of cell culture experiment. 

The combination of KD3 or GZ3 with radiation was also examined in the A549 
tumor-nude mouse model (see Example 4). A549 cells were injected into the hind flanks of 
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nude mice, and tumors were allowed to form. When tumors reached approximately 50-ul, 
they were injected with buffer or with 5 x 10 8 PFU of KD3 or GZ3. Eight to ten tumors were 
injected per test condition. At 1 day p.i., half the mice received 600 cGy of whole body 
radiation. Tumor size was measured over time, and was plotted as a fold-increase in tumor 
5 size versus days p.i. (as described in Example 4). As shown in Fig. 1 7, the non-radiated 
buffer-injected tumors grew faster than those injected with KD3 or GZ3. Tumors that 
received the combination of KD3 and radiation did not grow, and those that received the 
combination of GZ3 and radiation shrank in size after 14 days. These results indicate that the 
combination of KD3 plus radiation or GZ3 plus radiation is more effective than either vector 
1 0 alone or radiation alone in reducing the rate of A549 tumor growth in nude mice. It is likely 
that radiation would increase the effectiveness in treating tumors of KD1 and GZ1, or indeed 
any other replication-competent or replication-defective Ad vector. 

The mechanism by which radiation causes the ADP overexpressing vectors to lyse 
cells and spread from cell-to-cell more effectively is not understood. Radiation is expected to 
15 induce eel lular DNA repair mechanisms, and that may allow for more efficient synthesis of 
AdDNA. Radiation may enhance the function of ADP. ADP probably functions by 
interacting with one or more cellular proteins, and radiation may affect this protein(s) such 
that ADP functions more efficiently. 

It is believed that KD1, KD3, GZ1, or GZ3, or any other replication-competent Ad 
20 vector, when used in combination with radiation, will be more effective than vector alone or 
radiation alone in providing clinical benefit to patients with cancer. The vectors should allow 
more tumor destruction with a given amount of radiation. Stated another way, radiation 
should cause more tumor destruction with a given amount of vector. These vectors should 
also allow the radiation oncologist to use less radiation to achieve the same amount of tumor 
25 destruction. Less radiation would reduce the side effects of the radiation. 

It is also believed that a cocktail of vectors when used in combination with radiation 
will be more effective than the cocktail alone or radiation alone. The cocktail could consist of 
ADP producing vectors plus one or more replication defective vectors expressing an 
anticancer therapeutic protein (see Example 5). 
30 Example 9 

This example illustrates a structure-function analysis of adenovirus death protein. 
ADP is an 1 1 .6 kDa N-linked O-linked integral membrane glycoprotein that localizes 
to the inner nuclear membrane (NM) (Scaria et al., Virology 191 :743-753). As illustrated in 
Fig. 18, the Ad2-encoded ADP (SEQ ID NO:6) consists of 101 amino acids; aa 1-40 (SEQ ID 
35 NO: 17) are lumenal, aa 41-59 (SEQ ID NO: 18) constitute the transmembrane signal-anchor 
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(SA) domain, aa 63-70 (SEQ ID NO: 1 9) constitute a basic proline (BP) domain within the 
nucleoplasms (NP) domain, which constitutes aa 61-101 (SEQ ID NO:20). To determine 
which domains in ADP are required to promote cell death, a number of deletion mutants of 
reclOO were prepared which lacked various portions of the ADP gene and examined for the 
5 ability of ADP to localize to the NM and promote death. The reclOO virus is an Ad5-Ad-Ad5 
recombinant, which has been described elsewhere (Wold et al., Virology 7^5:168-180, 1986). 

The structure of ADP in redOO and in each deletion mutant is schematically 
illustrated in Fig. 18. The ADP gene in each deletion mutant has been sequenced using PCR 
methods to insure that the mutations are correct. The structure and activity of ADP in the 

1 0 deletion mutants was tested by infecting A549 cells followed by immunoblot analysis of the 
ADP mutant proteins as well as the ability to lyse cells. All deletion mutants expressed a 
stable ADP protein except /ww734.1 (Al-48, i.e. aa 1-48 are deleted). The pmllA.l (N 14 ) 
ADP, which has Asn M mutated to Ser, is O-glycosylated but not N-glycosylated because 
Asn M is the only N-glycosylation site (data not shown). The <//735 (A4-1 1) ADP is N- 

1 5 glycosylated but not O-glycosylated because the sites for O-glycosylation are deleted (data 
not shown). The pmlZAA (M56) ADP, which has Met 56 in the SA domain mutated to Ser, 
contains exclusively N-linked high-mannose oligosaccharides (data not shown); this occurs 
because the Met 56 mutation precludes exit of ADP from the endoplasmic reticulum (ER). The 
<//738 ADP, which lacks aa 46-60 in the signal-anchor domain, forms insoluble aggregates m 

20 the cytoplasm; therefore, aa 41-59 do in fact include the signal-anchor domain. The pmTSA 
(A 1-40) ADP, which initiates at Metti at the N-terminus of the SA domain, comigrated with 
the lower group of bands generated by proteolytic processing (data not shown). This 
indicates that the proteolytic cleavage sites occur near Met,,. Consistent with this, the 
proteolytic products were not seen with rf/737 (A29-45) (data not shown). Also, the size of 

25 the products decreased in all mutants with deletions within aa 4 1 - 1 0 1 (<//7 1 5 . 1 , dU 1 5, dtl 1 4, 
dVJ\6) (data not shown). 

The ability of these mutants to promote cell death was monitored by trypan blue 
exclusion, plaque development, and lactate dehydrogenase release assays (Tollefson et al., J. 
Virol. 70:2296-2306, 1996). The trypan blue results in Fig. 15A indicate that the death- 

30 promoting function of ADP was abolished by deletion of aa 1 -40 (pmllA), aa 1 1 -26 

(rf/736.1), aa 18-22 (<//735.1), or aa 4-1 1 (d/735). Mutation of the N-glycosylation site at 
Asn M (pm734.7) reduced the death-promoting activity to about 50% of rec700 (WT). dllYI 
(A29-45) was efficient as rec700 in promoting cell death; this indicates that the proteolytic 
processing products must not be required to promote cell death because they are not formed 

35 with dllll. The SA domain is essential for death because d/738 (A46-60) and pm734A 
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(M56) were completely defective (Fig. 19). J/715.1 was nearly completely defective, 
indicating that the BP domain is extremely important. Surprisingly, aa 71-94 (J/714), 76-89 
(J/715), and 79-101 (dl7l6) could be deleted without affecting the death-promoting activity 
of ADP (Fig. 19). On the other hand, deletion of aa 81-88 (J/717) nearly completely 
5 abolished the activity of ADP (Fig. 1 9); this is probably the result of aberrant sorting of ADP 
(see below). Similar results were obtained when the ability of these ADP mutants to promote 
cell death was examined with standard plaque development, LDH-release and MTT assays. 

The effects of these mutations on the intracellular localization of ADP are extremely 
interesting. When examined by immunofluorescence (IF) at 33 h p.i. (data not shown), ADP 
1 0 from retfOO (WT) localized crisply to the NM; localization to the Golgi was also apparent. 
With J/714 (A71-94) and J/715 (A76-89), ADP localized to all membranes, i.e. the ER, Golgi, 
plasma membrane, and NM. This was even more apparent at 45 h p.i. (data not shown) 
Thus, aa 71-94 appear to include a signal that directs ADP specifically to the NM. ADP is 
very likely sorted from the trans-Golgi network (TGN) to the NM, so this putative signal in 
15 ADP probably functions in this sorting pathway. ADP from J/717 (A81-88) is intriguing: it 
localized to the NM and Golgi, but in many cells "dots" and circular structures were observed. 
Again, this was more apparent at 45 h p.i. when these structures were the prominent feature. 
J/717-infected cells have not begun to die at 45 h p.i., so these structures are not cellular 
remnants. The intriguing possibility is that these structures are membrane vesicles that have 
20 pinched off from the TGN but are defective in targeting to and/or fusing with the NM. 

With J/738 (A46-60 in the SA domain), ADP aggregated in the cytoplasm. This 
again indicates that aa 46-60 include the SA sequence. With /wi734.4 (M56), ADP localized 
primarily to the NM. As discussed above, the ^734.4 ADP has exclusively high-mannose 
N-linked oligosaccharides, indicating that it never leaves the ER. Perhaps the putative NM- 
25 localization signal in the C-terminal region of the pm734A ADP targets ADP to the NM by 
lateral diffusion from the ER (which is continuous with the outer and inner NM). 

With J/737 (A29-45), ADP localized to the NM. ADP from ^734 (Al-40),/>/w734.7 
(N14) (N-Iinked glycosylation cannot occur), and J/735 (A4-1 1; the O-glycosylation sites are 
deleted) localized much more prominently to the Golgi than the NM. ADP from J/735. 1 
30 (A18-22) and J/736.1 (Al 1-26) also localized much more strongly to the Golgi than the NM. 
Thus, residues 1-26 and/or glycosylation appear to be required for efficient transport of ADP 
from the Golgi/TGN to the NM. 

In summary, aa 41-59 include the SA domain, Met 56 in the SA domain is required for 
exit from the ER, aa 1-26 are required for efficient exit from the Golgi, and aa 76-94 are 
35 required to target ADP specifically to the NM. With respect to promoting cell death, the 
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essential regions are aa 1 -26, the SA domain (ADP does not enter membranes), Met 56 in the 
SA domain, and the BP domain (aa 63-70). It is not clear whether the defective death- 
promoting phenotype of/w/734 (Al-40), dlTiS (A4-1 1), <ff735.I (A18-22), <#736.1 (Al 1-26), 
and/w«734.7 (N14) is due to lack of sequences (or oligosaccharides) that promote death or to 
5 much slower exit of ADP from the Golgi to the NM. rf/714 (A71-94) and dH\5 (A76-89) 
express a wild-type phenotype for promoting death even though they are defective in 
localizing specifically to the NM; this is probably because sufficient ADP still enters the NM 
to promote death. Even though the deletion in rf/717 (A81-88) lies within the deletions in 
dlllS (A76-89) and rf/714 (A71-94), the dP\7 ADP is only about 15% as efficient as rec700 
10 (WT), dl7\ 5 and dl714 in promoting death. This may be because the <f/717 ADP tends to 

remain in vesicles rather than localizing to the NM. Altogether, these data indicate that ADP 
must localize to the NM in order to promote cell death. 

Example 10 

This example further characterizes the tissue specific Ad vectors described in Example 6. As 
1 5 discussed therein, the Ad E4 promoter is deleted and replaced with the promoter for surfactant 
protein B (SPB) in these vectors (Figure 24). 
Materials and Methods 

Cells, vectors and methods described in Example 6 were also used in this Example. 
In addition to the human cancer cell lines A549 (human lung carcinoma), Hep 3B (human 
20 hepatocellular carcinoma), and H441 (papillary lung adenocarcinoma) used in Example 6, 
HEK 293 cells (obtained from Microbix (Toronto, ON)) and VK10-9 cells were used. VklO- 
9 cells are 293 cells that in addition to El contain and express E4 and pDC. These cells will 
be referred to as 293-E4 cells. 

Experiments employing phase contrast microscopy of Hep 3B and H441 cells were 
25 performed as follows. Monolayers of Hep 3B or H441 cells were grown in 60 mm dishes 

with 5 ml of DMEM (10% FBS), and were mock- infected or infected with KD1 or KD1-SPB 
at a multiplicity of infection of 10 plaque forming units (PFU) per cell. Phase contrast 
photographs of monolayers were taken at 4 and 7 days postinfection (p.i.). 

Experiments employing western blots of H441 or Hep 3B cells were performed as 
30 follows. H441 or Hep 3B cells (in 60 mm dishes) were infected with 10 PFU/cell of KD1 or 
KD1-SPB. At 24 h p.i., the cells were washed three times with PBS and harvested by 
scraping. The cells were lysed by REP A buffer. The protein concentration was measured by 
the BIO-RAD DC Protein Assay Kit (BIO-RAD Laboratories, Hercules, CA) and 10 ug of 
each sample were electrophoresed on 15% sodium dodecylsulfate polyacrylamide gels (SDS- 
35 PAGE). The gels were electroblotted onto PVDF membranes (Immobilon, Millipore, 
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Bedford, MA). The membranes were blocked in TBST (50 mM Tris-Cl, pH 7.6, 150 mM 
NaCl, 0.2% Tween 20) containing 10% dry milk (Carnation) overnight at 4°C. After 
blocking, the membranes were incubated with a rabbit polyclonal antiserum against E40RF3 
(gift of Gary Ketner) or ADP (Tollefson et al., J. Virol. 6(5:3633-3642, 1992), or with M73, a 
5 monoclonal antibody against El A (Harlow et al., J. Virol. 5.5:533-546, 1985). The secondary 
antibodies were goat anti-rabbit IgG-HRP or goat anti-mouse IgG-HRP. The blots were 
developed using the ECL protocol (Amersham Pharmacia, Arlington Heights, IL). 

Experiments employing a lactate dehydrogenase release assay for cell lysis (Tollefson 
et al., J. Virol. 70:2296-2306) were preformed as follows. H441 cells (7.7 x 10 5 cells per 35 
10 mm dish) and Hep 3B cells (9.0 x 10 5 cells per 35 mm dish) were infected at 20 PFU/cell in 
one ml serum-free DMEM. After an adsorption period of 1 h, 3 ml of DMEM (10% FBS) 
were added (final FBS concentration of 7.5%). Cells were incubated at 37°C with 6% C0 2 . 
At daily intervals, supernatants were collected, microfuged to remove floating cells, and cell- 
free supematants were frozen at -70°C until assayed. Total lysis samples were prepared by 
1 5 addition of 1 OX lysis buffer included in the Cyto Tox 96 kit (Promega, Madison, WI). After 
all samples were collected, 20 ul samples were assayed in triplicate using the LDH assay kit 
Cyto Tox 96 and read on an EL340 Microplate reader (BioTecTM Instruments, Inc.) at 490 
nm. 

Experiments employing immunofluorescence evaluation of H441 and Hep 3B cells 
20 were performed as follows. H441 and Hep 3B cells were plated on Coming #1 coverslips in 
35 mm dishes. H441 (1.5 x 10 6 cells/35 mm dish) and Hep 3B (9.0 x 10 s cells/35 mm dish) 
were infected with 20 PFU/cell of the indicated viruses in 1 ml serum-free DMEM. After 1 h, 
1 ml of DMEM/20% FBS was added (final concentration of 10% FBS). At the indicated 
times (48 h or 6 d p.i.), cells were fixed for 10 min in 3.7% paraformaldehyde in PBS, then 
25 permeabilized for 6 min in methanol (-20°C) and rehydrated in PBS. Coverslips were stained 
with rabbit antipeptide antiserum against the Ad E2A-coded DNA binding protein (DBP) 
(1:400 dilution; gift of Maurice Green) and mouse monoclonal antibody against fiber (1:400 
dilution; gift of Jeff Engler) or were stained with rabbit antiserum to E40RF3 (1:250 dilution; 
gift of Gary Ketner). Secondary antibodies (Cappel/ICN) were used at 1 :50 dilution. All 
30 antibodies were diluted in PBS containing 1 % BS A and 0. 1 % sodium azide. Photographs 

were taken on a Nikon epifluorescence microscope using a 100X Planapo lens and Tmax 400 
film (Kodak). The film was developed in Diafine developer. 

Analysis of viral DNA replication by Southern hybridization was performed as 
follows. H441 and Hep 3B cells were grown in 60 mm dishes in DMEM supplemented with 
35 10% FBS. Cells were infected at 70% confluence with 10 PFU/cell of KD1 or KD1-SPB. 
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Dishes were incubated in humidified 5% CO, atmosphere at 37°C. Total genomic DNAs 
were isolated at 5, 24, 48, 72, and 96 h p.i. Equal amounts of total genomic DNAs were 
digested with HindlH and resolved on a 1% agarose gel prior to transfer onto membranes. A 
random primer 32 P-labeled pBHGlO plasmid probe (Bert et al., Proc. Natl. Acad. Set. USA 
5 97:8802-8806, 1994) was used for hybridization, and the blots were autoradiographed. DNA 
fragments were quantitated on a Molecular Dynamics Phosphorlmager. 

Virus yields were determined as follows. Hep 3B cells or H441 cells grown as 
monolayers in 35 mm dishes were infected with 10 PFU/cell of KD1 or KD1-SPB. At days 0 
to 4 (for H441) or days 0 to 9 (for Hep 3B) p.i., cells and culture medium were frozen at - 
1 0 70°C. Samples were frozen and thawed three times to release the virus from the cells, and 
total virus yields were determined by plaque assay on A549 monolayers. 

The effect of KD1-SPB and KD1 on H441 and Hep 3B tumors was examined in a 
nude mouse model (Doronin et al., /. Virol. 74:6147-6155,2000). Tumor cells (10 7 cells in 
200 ul of DMEM, 50% Matrigel [Becton Dickinson Labware, Bedford, MA] for H441 cells, 
15 or 10 7 cells in 200 ul of DMEM plus 10% Matrigel for Hep 3B cells) were injected into flanks 
of 5-6 weeks old athymic nude mice and allowed to grow for three weeks to about 100 ul 
(H441) or 150 ul (Hep 3B) volumes. Pre-established tumors (n = 10) were injected with 50 ul 
of DMEM or 5 x 10 7 PFU of indicated viruses in DMEM. Injections of the viruses were 
repeated twice weekly for 3 weeks to the total dose of 3.0 x 10 8 PFU per tumor. Tumor size 
20 measurements were taken twice per week for H441 cells, or weekly for Hep 3B cells using a 
Sylvac digital caliper. Tumor volumes were calculated in according to the formula: length x 
width 2 12. Data are represented as means of increase in tumor size relative to the tumor size at 
the initial injection. 
Results 

25 The properties of KD1 -SPB in various cell types were compared to those of its 

"parent", KD1. Figure 25 shows the plaque development properties of these vectors on 293- 
E4, 293, and A549 cells. The data are plotted as the number of plaques seen on any day of 
the plaque assay as a percentage of the number of plaques seen at the end of the assay (i.e. 
when new plaques cease to appear) (Tollefson et al., J. Firo/. 70:2296-2306, 1966). This 

30 assay is an indicator of the size of the plaques. KD 1 formed plaques equally well on 293-E4 
and 293 cells (Figure 25A). With KD1-SPB, plaques were observed about 3-4 days sooner on 
293-E4 compared to 293 cells (Fig. 2A). On A549 cells, KD1 formed plaques 4-6 days 
sooner than KD1-SPB (Figure 25B). 

The properties of KD1-SPB versus KD1 were characterized in detail in H441 cells, a 

35 human papillary lung adenocarcinoma cell line known to express the TTF1 transcription 
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factor and in which the SPB promoter is active (Yan et al., /. Biol. Chem. 270:24852-24857, 
1995). Hep 3B cells, a human hepatocellular carcinoma in which the SPB promoter should 
not be active, were used as a negative control. H441 and Hep 3B monolayers were infected 
with 10 PFU/cell of KD1 or KD1-SPB and photographed at 4 and 7 days p.i. Mock-infected 
5 Hep 3B cells formed a relatively homogeneous monolayer, but H441 cells tended to form 
structures that resemble syncytia (Figure 26A, B). As expected, KD1 produced cytopathic 
effect (CPE) on both cell lines at 4 and 7 days p.i. (Figure 26A, B). Also as expected, KD1- 
SPB caused CPE on H441 cells but not on Hep 3B cells. Since CPE in Ad-infected cells is 
usually an indicator of virus growth, these results suggest that KD1-SPB grows in H441 but 
10 not in Hep 3B cells. 

To examine viral DNA replication, H441 and Hep 3B cells were infected with 10 
PFU/cell of KD1 or KD1-SPB, then the accumulation of viral DNA was determined by DNA 
blot. With H441 cells, KD1 and KD1-SPB DNAs were readily detected at similar levels at 
48-96 h p.i. (Figure 27A). With Hep 3B cells, KD1 DNA levels were similar to those in 
15 H441 cells, but KD1-SPB DNA was barely detectable. This was confirmed by 
Phosphorlmager analysis of the DNA bands (Figure 27B). 

Growth of KDI-SPB and KD1 in H441 and Hep 3B cells was determined by a single 
step growth assay. Cells were infected with 10 PFU/cell of vector, then total vector yield was 
determined by plaque assay. Total yield of both vectors was similar in H441 cells, reaching a 
20 plateau after 2 days (Fig. 28A). KD1 yield plateaued in Hep 3B cells after 2-4 days p.i. 

(Figure 28B). However, KDI-SPB levels were about 5 logs lower in Hep 3B cells after 2-4 
days, and even by 9 days they had not achieved the levels of KD1 . We conclude that KDI- 
SPB grows with significant specificity on H441 versus Hep 3B cells. Further, KDI-SPB 
grows as well as KD1 on H441 cells, indicating that the E4 promoter deletion by itself does 
25 not significantly compromise the vector, and that the E4 promoter can be replaced by a tissue- 
specific promoter in a replication-competent vector. 

To obtain further details on the replication of KDI-SPB vs KD1 in H441 and Hep 3B 
cells, the expression of representative Ad proteins by KDI-SPB and KD1 was examined. 
H441 or Hep 3B cells were mock-infected or infected with 10 PFU/ml of KD1 or KDI-SPB, 
30 then at 24 h p.i. the proteins were extracted and the El A, E40RF3, and ADP proteins were 
examined by immunoblot. E40RF3 is one of the six proteins coded by the E4 transcription 
unit(Leppard,y. Gen. Virol. 75:2131-2138, 1997). As anticipated, KDI-SPB expressed 
E40RF3 well in H441 cells, but only at trace levels in Hep 3B cells (Figure 29). KDI-SPB 
expressed the E1A proteins in Hep 3B cells. Synthesis of E1A proteins by KDI-SPB in Hep 
35 3B cells is expected because El A expression does not require E4 proteins; it also indicates 
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that the block to infection with KD1-SPB is downstream of E1A. KD1 expressed E1A in 
both cell lines, but the amount was less than obtained with KD1-SPB in Hep 3B cells (Figure 
29). The increased E1A levels seen with KD1-SPB may reflect its poor ability to enter the 
late phase of infection (see Discussion). KD1-SPB expressed ADP as well as KD1 in H441 
5 cells, but it did not make detectable ADP in Hep 3B cells. ADP is primarily a late protein, so 
this result is consistent with the relative lack of E4 protein expression, DNA replication, and 
growth of KD1-SPB in Hep 3B cells. 

To gain insights into replication events that occur in individual cells, expression of 
E40RF3, the E2A-DBP, and the fiber late protein was examined by immunofluorescence. 

10 H441 or Hep 3B cells were infected with 20 PFU/cell. At 48 h or 6 days p.i., cells were fixed 
and immunostained. E40RF3 was detected in the nuclei of H441 cells at 48 h p.i. with KD1, 
KD1-SPB, or dl309 (Figure 30A). (dl309 is an Ad5 mutant that has wild-type El A, expresses 
Ad5 levels of ADP, and lacks the E3-RID and E3-14.7K genes). E40RF3 could not be 
detected in the vast majority of Hep 3B cells infected with KD1-SPB (Figure 30A), even at 6 

1 5 days p.i . (Figure 30B). Thus, KD 1 -SPB expresses E40RF3 well in H44 1 but not in Hep 3B 
cells. 

Figure 31A shows double label immunofluorescence of DBP and fiber in the same 
Hep 3B cells at 48 h p.i. with KD1 or KD1-SPB. With KD1, there was a strong speckled 
staining pattern in the nucleus that is typical for DBP at 48 h p.i. (Figure 31 A, top left panel). 

20 There was strong staining of fiber throughout these same cells (Figure 3 1 A top right panel). 
Staining of the cytoplasm and nucleus is expected because fiber is synthesized in the 
cytoplasm and then transported to the nucleus where virions assemble. With KD1-SPB at 48 
h p.i., about 25% of the cells showed the speckled staining for DBP, and only one cell (7% of 
total) with the advanced speckled pattern was also stained for fiber (Figure 31 A, bottom two 

25 panels). Even at 6 days p.i., only about 30% of cells showed staining for DBP, and about 
20% for fiber (Figure 3 IB). Thus, markedly fewer Hep 3B cells infected with KD1-SPB 
expressed DBP and especially fiber as compared to KD1 . These results indicate that KD1- 
SPB replicates as well as KD1 in H441 cells, no doubt because the SPB promoter is active in 
H441 cells (Yan et al., J. Biol. Chem. 270:24852-24857, 1995). KD1-SPB barely replicates 

30 in Hep 3B cells, presumably because the SPB promoter is minimally active in these cells. 

At the culmination of replication, Ad-infected cells are lysed and the virus spreads to 
other cells; this process is mediated in large part by ADP (Tollefson et al., Virology 220:152- 
162, 1996; Tollefson etal.,7. Virol. 70:2296-2306, 1996). To examine vector-induced cell 
lysis, H441 and Hep 3B cells were mock-infected or infected with 20 PFU/cell of KD1, KD1- 

35 SPB, or dl309, and cell lysis was determined by release of lactate dehydrogenase (Tollefson et 
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al.,J. Virol. 70:2296-2306, 1996). All vectors lysed H441 cells beginning at 2-3 days p.i. 
(Figure 32A). KD1 and dl309 also lysed Hep 3B cells in the same time period; however, 
KD1-SPB caused only minimal cell lysis (Figure 9B). Thus, these data, along with the cell 
spread data in Example 6 and Figure 13, demonstrate that KD1-SPB lyses cells and spreads 
5 efficiently from cell-to-cell in H441 but not Hep 3B cells. 

An experiment was conducted to determine whether KD1-SPB or KD1 would 
suppress H441 tumors in nude mice. H441 cells were injected into each hind flank. When 
tumors had grown to about 100 ul (H441) or 150 pi (Hep 3B), they were injected twice 
weekly for 3 weeks with DMEM (mock) or 5 x 10 7 PFU of test virus in 50 pi of DMEM (3.0 

10 x 10 8 total PFU). Ten tumors (5 mice) were used for each virus. Growth of H441 tumors was 
suppressed similarly by KD1-SPB and KDI (Figure 33A). KD1 suppressed growth of Hep 
3B tumors, whereas KD1-SPB caused only minimal suppression (Figure 33B). These results 
show that KD1-SPB is as effective as KDI in suppressing tumors when the SPB promoter is 
active. Further, the cell type specificity observed with KDI -SPB in vitro is maintained in 

15 vivo. 

Discussion 

Tumor specificity is one of the biggest challenges facing cancer gene therapy, i.e. 
having the therapeutic gene be expressed specifically in cancer cells. Specificity is very 
important for RC viruses. Two main strategies have been described that in theory confer 

20 specificity: transductional targeting and transcriptional targeting. Directing specificity of 
vectors toward specific cell surface receptors on the target cells has been attempted through 
various methods. Although this approach is theoretically attractive it might encounter 
multiple obstacles such as the lack of incorporation of the engineered protein into the virion 
(Scaria et al., Virology 797:743-753, 1992) or lack of infectivity through the targeted receptor 

25 (Cosset et al., J. Virol. 69:61 14-6322, 1995). Transcriptional targeting utilizes tumor and 

tissue specific promoters. In replication-defective vectors these regulatory sequences confine 
the expression of cytotoxic genes to specific tissues. In replication-competent vectors, as an 
added layer of regulation, vector replication per se can be placed under the control of tumor or 
tissue specific promoter/enhancer sequences. In replication-competent Ad, insertion of the 

30 tissue or tumor specific promoter/enhancer into the E1A promoter/enhancer region has been 
used exclusively (Hallenbeck et al., Hum. Gene Ther. 70:1721-1733, 1999; Rodriguez et al., 
Cancer Res. 57:2559-2563, 1997; Yu et al., Cancer Res. 59, 4200-4203, 1999; Yu etal., 
Cancer Res. 59: 1498-1504, 1999). The rationale behind these vectors is that expression of 
E1A and therefore the whole Ad transcription program will depend on these tissue or tumor 

35 specific promoters. However, as a generic approach, there may be difficulties. The E1A 
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enhancer/promoter is very complex. The enhancer controls not only the El A promoter but 
also distant promoters such as the E4 promoter (Shenk, T. pp. 2 1 1 1-2148 In B.N. Fields, 
D.M. Knipe, and P.M. Howley (eds.), Fields Virology, Lippincott-Raven, Philadelphia, 
1996). In addition, it has been shown that the E1A enhancer in the inverted terminal repeat 
5 region changes tissue specificity of cellular promoters (Shi et al., Hum. Gene Ther. 8:403- 
410, 1997). Also, the El A enhancer/promoter is partially embedded within the signals 
required to package the Ad genome into virions, and it may be problematic to remove all the 
El A enhancer elements without impairing virus production. Accordingly, we chose to 
replace the E4 promoter with a tissue specific promoter. E4 genes are essential for Ad 
1 0 replication, and therefore we expected that the replication of the recombinant virus would be 
dependent on the tissue specific regulatory elements. 

To construct KD1-SPB, the ca. 300 bp of the E4 promoter was deleted and the B-500 
version (ca. 500 bp) of SPB promoter was inserted (Yan et al., supra) (Figure 24 C, D). We 
selected the SPB promoter because of its strict tissue specificity: it is exclusively active in 
15 type II alveolar cells and bronchial epithelial cells of the lung (Bohinski et al., 1 994, Mol. 
Cell. Biol. 7*5671-5681, 1994). Since the parental virus KD1 contains and expresses two 
El A mutations that restrict virus replication to tumor cells (Doronin et al., supra), we 
anticipated that the virus would selectively replicate in cells derived from lung tumors. Thus, 
H441 cells, a papillary lung carcinoma cell line, were used to characterize the replication, 
20 gene expression, and functional profile of KD1-SPB. 

KD1-SPB formed plaques 3-4 days sooner on 293-E4 cells that express E4 proteins 
than on 293 cells, whereas KDl formed plaques with the same kinetics on both cell lines. 
These data show that the E4 promoter is active in 293 cells, and that the SPB promoter 
displays very low activity in 293 cells. It is not clear why KD1-SPB forms plaques on 293 
25 cells; these cells are derived from human embryonic kidney and at least one of the 

transcription factors regulating the SPB promoter (Bohinski et al., supra), hepatocyte nuclear 
factor 3, is expressed in embryonic kidney. It is also possible that TTF1, the master 
regulatory factor of SPB expression, is minimally active in 293 cells. 

KDl grew to equally high titers in H441 and Hep 3B cells (Figure 28A B). In 
30 contrast, KD1-SPB replicated as efficiently as KDl in H441 cells, in which the SPB promoter 
is active (Yan et al., supra) (Figure 28A), but replicated poorly in Hep 3B cells, most likely 
because the SPB promoter is inactive (Figure 28B). This selectivity has been confirmed by 
measuring viral DNA production in the two cell lines. KD1-SPB DNA replication was 
similar both kinetically and quantitatively to KDl DNA replication in H441, however in Hep 
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3B cells, KD1-SPB DNA was almost undetectable (Figure 27 A, B). The cytopathic effect, a 
surrogate marker of Ad replication, showed a similar specificity (Figure 26). 

To further confirm our predictions on the molecular basis of the observed issue 
specificity we monitored viral protein expression. When cells were infected with KD1-SPB 
5 all the viral proteins early or late, except for El A, were expressed in a tissue-specific fashion 
(high expression in H441, low to undetectable expression in Hep 3B) (Figures 29-31). We 
found a good correlation between the levels of E4 promoter activity (E40RF3 expression) 
and the expression of E2A-DBP, ADP, and fiber proteins. Thus, the SPB promoter retains its 
tissue specificity in the Ad genome and it seems to be the limiting factor of Ad gene 

1 0 expression in the cell lines tested. As expected, expression of El A is not tissue-specific. 
Thus, the regulatory step of tissue-specific Ad DNA replication is downstream of E1A. In 
Hep 3B cells, KD1-SPB expressed El A at a higher level than did KD1 (Figure 29), strongly 
suggesting that KD1-SPB replication in most of the Hep3B cells remains at the early stage. 
The cytolytic effect of KD1-SPB also showed a tissue-specific profile (Figure 32; 

1 5 Figure 13 of Example 6), i.e., preferential lysis of H441 cells over Hep 3B cells, a pattern 
similar to the specificity observed at the level of DNA replication (Figure 27) and viral 
protein synthesis (Figures 29-3 1). This cell type specificity was also observed when these 
cells were growing as tumors in nude mice. Growth of H441 tumors was suppressed by KD1- 
SPB and KD1 at similar efficacy (Figure 33 A). In contrast, KD1-SPB unlike KD1 had only 

20 minimal effect on the growth of Hep 3B tumors (Figure 33B). 

In summary, substitution of the E4 promoter with a tissue specific promoter allows 
highly tissue specific replication of Ad vectors and in the target tissue it is as efficient as the 
replication of the parental virus. KD1-SPB lacks all E3 genes except ADP. E3 gpl9K, RID 
and 14.7K have been shown to protect Ad-infected cells from attack by cytotoxic 

25 lymphocytes and apoptosis-inducing cytokines such as tumor necrosis factor and Fas ligand 
(Wold et al, pp. 200-232 In A.J. Cann (ed.), DNA Virus Replication: Frontiers in Molecular 
Biology, Oxford University Press, Oxford, 2000; Wold et al., Curr. Opin. Immunol. 11:380- 
386, 1999). 

The therapeutic index (virus produced in H441 cells compared to Hep 3B cells) of 
30 KD1-SPB is 10 4 -10 5 for the first 4-5 days (Figure 28). These data compare to data reported 
by Calydon (lO'-lO 5 ) for their prostate specific viruses (Rodriguez et al., supra; Yu et al., 
Cancer Res. 59, 4200-4203, 1999; Yu et al., Cancer Res. 59:1498-1504, 1999). We suggest 
that KD1-SPB has some added advantage over vectors reported by other laboratories because 
it encodes a mutant form of El A that restricts replication to cancer cells (Doronin et al., 
35 supra). 



Copied from 10206747 on 07-04-2004 



WO 01/04282 



PCT/US00/18971 



50 

Although the lung ranks as the second highest cancer site for both men and women in 
the U.S. Reis et al., Cancer Res. 55:2398-2424, 2000), lung cancer has not been a major target 
for cancer vector gene therapy since intratumoral injection of virus is generally not feasible in 
the lungs. However, there has been a recent report of intratumor injection of a replication- 
5 defective Ad vector into a lung tumor, and such an approach could be attempted with KD1- 
SPB. It may also be feasible to administer KD1-SPB systemically in the lung. 

In view of the above, it will be seen that the several advantages of the invention 
are achieved and other advantageous results attained. 

As various changes could be made in the above methods and compositions 
10 without departing from the scope of the invention, it is intended that all matter contained in 
the above description and shown in the accompanying drawings shall be interpreted as 
illustrative and not in a limiting sense. 

All references cited in this specification, including patents and patent 
applications, are hereby incorporated by reference. The discussion of references herein is 
1 5 intended merely to summarize the assertions made by their authors and no admission is made 
that any reference constitutes prior art. Applicants reserve the right to challenge the accuracy 
and pertinence of the cited references. 
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What is Claimed Is : 

1 . A recombinant vector which is replication-competent in a neoplastic cell and 
which overexpresses an adenovirus death protein. 

2. The recombinant vector of claim 1 wherein the adenovirus death protein 
comprises amino acids 1-26, 41-59, and 63-70 of SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, or SEQ ID NO:8 or a conservatively substituted variant thereof or wherein the 
adenovirus death protein comprises SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID 

5 NO:8. 

3. The recombinant vector of claim 2 which comprises a recombinant virus. 

4. The recombinant vector of claim 3, wherein the recombinant virus is an 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gpl9K; RIDa; RIDP and 14.7K. 

5 . The recombinant vector of claim 4 which comprises SEQ ID NO:3 or SEQ 

ID NO:4. 

6. The recombinant vector of claim 3 which is replication-restricted to 
neoplastic cells. 

7. The recombinant vector of claim 6 which comprises SEQ ID NO: 1 or SEQ 

IDNO:2. 

8. The recombinant vector of claim 3, wherein the recombinant adenovirus 
comprises a tissue specific promoter, a tumor specific promoter, or an inducible promoter 
substituted for the E4 promoter. 

9. The recombinant vector of claim 8, wherein the tissue-specific promoter is a 
surfactant protein B promoter. 

10. The recombinant vector of claim 6 which comprises SEQ ID NO: 14, SEQ ID 
NO: 15 or SEQ ID NO: 16. 

1 1 . The recombinant vector of claim 1 , wherein the vector further comprises a 
gene encoding an anti-cancer product. 

12. The recombinant vector of claim 11, wherein the gene encoding an anti- 
cancer product is in the E3 region of the vector. 

13. A method for promoting death of a neoplastic cell comprising contacting the 
neoplastic cell with at least one vector which is replication competent in the neoplastic cell 
and which overexpresses an adenovirus death protein. 

14. The method of claim 13 wherein the adenovirus death protein comprises 
amino acids 1-26, 41-59, and 63-70 of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ 
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ID NO: 8 or a conservatively substituted variant thereof or wherein the adenovirus death 
protein comprises SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:8. 

15. The method of claim 14, wherein the vector comprises a recombinant 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gpl9K; RIDcx; REDp and 14.7K. 

16. The method of claim 15, wherein the neoplastic cell comprises a tumor in a 
patient and the contacting step comprises administering the recombinant adenovirus to the 
tumor. 

17. The method of claim 16, further comprising the step of passively immunizing 
the patient against the recombinant adenovirus. 

18. The method of claim 17, wherein the recombinant adenovirus comprises SEQ 
IDNO:3orSEQIDNO:4. 

19. The method of claim 15, wherein the vector is replication-restricted to 
neoplastic cells. 

20. The method of claim 19, wherein the vector is a recombinant adenovirus 
comprising SEQ ID NO: 1 or SEQ ID NO:2. 

21. The method of claim 15, wherein the recombinant adenovirus comprises a 
tissue specific promoter or an inducible promoter substituted for the E4 promoter. 

22. The method of claim 21, wherein the tissue specific promoter is a surfactant 
protein B promoter. 

23. The method of claim 22, wherein the recombinant adenovirus comprises SEQ 
ID NO: 14, SEQ ID NO: 1 5 or SEQ ID NO: 1 6. 

24. The method of claim 16, further comprising treating the tumor with radiation. 

25. The method of claim 24, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with radiation. 

26. The method of claim 16, further comprising treating the tumor with 
chemotherapy. 

27. The method of claim 26, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with chemotherapy. 

28. The method of claim 16, further comprising administering to the tumor one 
or more replication-defective adenovirus which expresses an anti-cancer gene product, 
wherein the recombinant adenovirus complements spread of the replication-defective 
adenovirus in the tumor. 

29. A composition comprising: 
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a first recombinant virus which is replication competent in a neoplastic cell and 
overexpresses an adenovirus death protein; and 

a second recombinant virus which is replication defective and which expresses an 
5 anti-cancer gene product, 

wherein the first recombinant virus complements replication of the second recombinant virus. 

30. The composition of claim 29 wherein the first recombinant virus comprises a 
recombinant adenovirus lacking expression of at least one E3 protein selected from the group 
consisting of: gpl9K; RIDa; RTOP and 14.7K. 

31. The composition of claim 30 wherein the recombinant adenovirus comprises 
a nucleotide sequence selected from the group consisting of: SEQ ID NO:l; SEQ ID NO:2; 
SEQ ID NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO:3; or SEQ ID NO:4. 

32. A composition comprising 

a first recombinant virus which is replication-defective in a neoplastic cell 
and which overexpresses an adenovirus death protein, and 

a second recombinant virus which is replication-competent in a neoplastic 

cell. 
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6 Ad2 VCLIIMWLIC CLKRRRARPP IYRPIIVLNP HNEKIHRLDG LKPCSLLLQY D 

7 Ad5 VCLIIMWLIC CLKRKRARPP IYSPIIVLHP NNDGIHRLDG LKHMFFSLTV - 
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Seq. ID No. 

17 aa 1-40 of Ad2 ADP MTGSTIAPTT DYRNTTATGL TSALNLPQVH AFVNDWASLD 

18 aa 41-59 of Ad2 ADP MWWFSIALMF VCLIIMWLI 

19 aa 63-70 of Ad2 ADP KRRRARPP 

20 aa 60-101 of Ad2 ADP C CLKRRRARPP IYRPIIVLNP HNEKIHRLDG LKPCSLLLQY D 
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LOCUS ad5 comple 3S935 bp DNA SYN 06-FEB-1999 

DEFINITION ad5 complete genome 
ACCESSION ad5 comple 
KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown 

Unclassified. 
REFERENCE 1 (bases I to 35935) 

AUTHORS Self 

JOURNAL Unpublished. 
BASE COUNT 8367 a 10073 C 9761 g 7734 t 

ORIGIN 

1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 
601 AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 
661 TCCTAGCCAT TTTGAACCAC CTACCCTTCA CGAACTGTAT GATTTAGACG TGACGGCCCC 
721 CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 
781 GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 
841 CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 
901 CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACGA 
961 CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 
1021 CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 
1081 CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG CAGTGGGTGA 
1141 TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 
1201 GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 
1261 CCAGAACCGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 
1321 CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 
1381 CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 
1441 GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 
1501 CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 
1561 TTGCGTGTGT GGTTAACGCC TTTGTTTGCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 
1621 GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCTTAAAGG GTATATAATG 
1681 CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 
1741 TTTTCTGCTG TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGGAGG 
1801 TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTAAGGAGGA TTACAAGTGG 
1861 GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 
1921 CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 
1981 GCGGCTGCTG TTGCTTTTTT GAGTTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 
2041 AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 
2101 AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC GGAGGAGCAG 
2161 CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG GAACCCGAGA 
2221 GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 
2281 GACGCATTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAGCGGG 
2341 GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG ACCAGACACC 
2401 GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 
2461 TGGCGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTG GCTGCAGCCA GGGGATGATT 
2521 TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCACTTAG GCCAGATTGC AAGTACAAGA 
2581 TCAGCAAACT TGTAAATATC AGGAATTGTT GCTACATTTC TGGGAACGGG GCCGAGGTGG 
2641 AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT GATAAATATG TGGCCGGGGG 
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2701 TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTTTAGCG 
2761 GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC TATGGGTTTA 
2821 ACAATACCTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC TTTTACTGCT 
2881 GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 
2941 AAAGGTGTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGCCAC AATGTGGCCT 
3001 CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 
3061 GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTGTCACC 
3121 TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 
3181 ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 
3241 AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC AAGGTGAACC 
3301 TGAACGGGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 
3361 GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCTGTGATGC 
3421 TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGCGCTGAGT 
3481 TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 
3541 TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCG 
3601 CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTGACAACGC 
3661 GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGGTCGCC 
3721 CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA ACGCCGTTGG 
3781 AGACTGCAGC CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 
3841 ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCCGCGATG 
3901 ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 
3961 CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCCCTCCCA 
4021 ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 
4081 CTTGCTGTCT TTATTTAGGG GTTTTGCGCG CGCGGTAGGC CCGGGACCAG CGGTCTCGGT 
4141 CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 
4201 ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCG 
4261 GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 
4321 CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 
4381 TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 
4441 TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 
4501 TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG TGGAAGAACT 
4561 TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 
4621 GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 
4681 CCAGGATGAG ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTG CCAGACTGCG 
4741 GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATTTCCCACG 
4801 CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA ACGGTTTCCG 
4861 GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 
4921 CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAGCTGCAGC 
4981 TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTGACT CGCATGTTTT 
5041 CCCTGACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT TGCAAGGAAG 
5101 CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 
5161 GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA TCCAGCATAT 
5221 CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT GCTCGTCCAG 
5281 ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 
5341 GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTGCT 
5401 GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGGTAGCATT TGACCATGGT 
5461 GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT TGGAGGAGGC 
5521 GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA GAAATACCGA 
5581 TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT CCACGAGCCA 
5641 GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT TGATGCGTTT 
5701 CTTACCTCTG GTTTCCATGA GCCGGTGTCC ACGCTCGGTG ACGAAAAGGC TGTCCGTGTC 
5761 CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCGGTGTT CCGCGGTCCT CCTCGTATAG 
5821 AAACTCGGAC CACTCTGAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 
5881 GGAGGGGTAG CGGTCGTTGT CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 
5941 GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTGTAGGCCA CGTGACCGGG 
6001 TGTTCCTGAA GGGGGGCTAT AAAAGGGGGT GGGGGCGCGT TCGTCCTCAC TCTCTTCOGC 
6061 ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG CGGGCATGAC 
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6121 TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGGCCCGC 
6181 GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 
6241 AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAGC AACTTGGCGA TGGAGCGCAG 
6301 GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACGTATTC 
6361 GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTGCGC TCGTCGGGCA CCAGGTGCAC 
6421 GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 
6481 GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG GTAGGGGGTC 
6541 TAGCTGCGTC TCGTCCGGGG GGTCTGCGTC CACGGTAAAG ACCCCGGGCA GCAGGCGCGC 
6601 GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGCGCC TGCTGCCATG CGCGGGCGGC 
6661 AAGCGCGCGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 
6721 GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 
6781 AGGGTAGCAT CTTCCACGGC GGATGCTGGC GCGCACGTAA TCGTATAGTT CGTGCGAGGG 
6841 AGCGAGGAGG TCGGGACCGA GGTTGCTACG GGCGGGCTGC TCTGCTCGGA AGACTATCTG 
6901 CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGACGT TGAAGCTGGC 
6961 GTCTGTGAGA CCTACCGCGT CACGCACGAA GGAGGCGTAG GAGTCGCGCA GCTTGTTGAC 
7021 CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT TGATGATGTC 
7081 ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGGACAAACT CTTCGCGGTC 
7141 TTTCCAGTAC TCTTGGATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC CTAGCATGTA 
7201 GAACTGGTTG ACGGCCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG CGTATGCCTG 
7261 CGCGGCCTTC CGGAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA TGACTTTGAG 
7321 GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA AAAAGTCCGT 
7381 GCGCTTTTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA GTATCTTTCC 
7441 CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTCGG AACGGTTGTT 
7501 AATTACCTGG GCGGCGAGCA CGATCTCGTC AAAGCCGTTG ATGTTGTGGC CCACAATGTA 
7561 AAGTTCCAAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT CCTCGTAGGT 
7621 GAGCTCTTCA GGGGAGCTGA GCCCGTGCTC TGAAAGGGCC CAGTCTGCAA GATGAGGGTT 
7681 GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT GGTCGCGAAA 
7741 GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA AGGTAAGCGG 
7801 GTCTTGTTCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG CAGTCACTAG 
7861 AGGCTCATCT CCGCCGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT TCCCAAAGGC 
7921 CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG TGCGAGGATG 
7981 CGAGCCGATC GGGAAGAACT GGATCTCCCG CCACCAATTG GAG GAGTG GC TATTGATGTG 
8041 GTGAAAGTAG AAGTCCCTGC GACGGGCCGA ACACTCGTGC TGGCTTTTGT AAAAACGTGC 
8101 GCAGTACTGG CAGCGGTGCA CGGGCTGTAC ATCCTGCACG AGGTTGACCT GACGACCGCG 
8161 CACAAGGAAG CAGAGTGGGA ATTTGAGCCC CTCGCCTGGC GGGTTTGGCT GGTGGTCTTC 
8221 TACTTCGGCT GCTTGTCCTT GACCGTCTGG CTGCTCGAGG GGAGTTACGG TGGATCGGAC 
8281 CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA GCTTGATGAC 
8341 AACATCGCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA GGTCAGGCGG 
8401 GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGCG CGGGCTAGAT CCAGGTGATA 
8461 CCTAATTTCC AGGGGCTGGT TGGTGGCGGC GTCGATGGCT TGCAAGAGGC CGCATCCCCG 
85 21 CGGCGCGACT ACGGTACCGC GCGGCGGGCG GTGGGCCGCG GGGGTGTCCT TGGATGATGC 
8581 ATCTAAAAGC GGTGACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG ACCCGCCGGG 
8641 AGAGGGGGCA GGGGCACGTC GGCGCCGCGC GCGGGCAGGA GCTGGTGCTG CGCGCGTAGG 
8701 TTGCTGGCGA ACGCGACGAC GCGGCGGTTG ATCTCCTGAA TCTGGCGCCT CTGCGTGAAG 
8761 ACGACGGGCC CGGTGAGCTT GAGCCTGAAA GAGAGTTCGA CAGAATCAAT TTCGGTGTCG 
8821 TTGACGGCGG CCTGGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG ATAGGCGATC 
8881 TCGGCCATGA ACTGCTCGAT CTCTTCCTCC TGGAGATCTC CGCGTCCGGC TCGCTCCACG 
8941 GTGGCGGCGA GGTCGTTGGA AATGCGGGCC ATGAGCTGCG AGAAGGCGTT GAGGCCTCCC 
9001 TCGTTCCAGA CGCGGCTGTA GACCACGCCC CCTTCGGCAT CGCGGGCGCG CATGACCACC 
9061 TGCGCGAGAT TGAGCTCCAC GTGCCGGGCG AAGACGGCGT AGTTTCGCAG GCGCTGAAAG 
9121 AGGTAGTTGA GGGTGGTGGC GGTGTGTTCT GCCACGAAGA AGTACATAAC CCAGCGTCGC 
9181 AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGGCCTC GTAGAAGTCC 
9241 ACGGCGAAGT TGAAAAACTG GGAGTTGCGC GCCGACACGG TTAACTCCTC CTCCAGAAGA 
9301 CGGATGAGCT CGGCGACAGT GTCGCGCACC TCGCGCTCAA AGGCTACAGG GGCCTCTTCT 
9361 TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT CTTCTTCTGG CGGCGGTGGG 
9421 GGAGGGGGGA CACGGCGGCG ACGACGGCGC ACCGGGAGGC GGTCGACAAA GCGCTCOATC 
9481 ATCTCCCCGC GGCGACGGCG CATGGTCTCG GTGACGGCGC GGCCGTTCTC GCGGGGGCGC 
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9541 AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT GCCATGCGGC 
9601 AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTGTG TAGGTACTCC GCCGCCGAGG 
9661 GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GTCTAACCAG 
9721 TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCGGGCGGCA GCGGGCGGCG GTCGGGGTTG 
9781 TTTCTGGCGG AGGTGCTGCT GATGATGTAA TTAAAGTAGG CGGTCTTGAG ACGGCGGATG 
9841 GTCGACAGAA GCACCATGTC CTTGGGTCCG GCCTGCTGAA TGCGCAGGCG GTCG GCCA TG 
9901 CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT GAGCCTTTCT 
9961 ACCGGCACTT CTTCTTCTCC TTCCTCTTGT CCTGCATCTC TTGCATCTAT CGCTGCGGCG 
10021 GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC CCCGAAGCCC 
10081 CTCATCGGCT GAAGCAGGGC TAGGTCGGCG ACAACGCGCT CGGCTAATAT GGCCTGCTGC 
10141 ACCTGCGTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGCGGTGGTA TGCGCCCGTG 
10201 TTGATGGTGT AAGTGCAGTT GGCCATAACG GACCAGTTAA CGGTCTGGTG ACCCGGCTGC 
10261 GAGAGCTCGG TGTACCTGAG ACGCGAGTAA GCCCTCGAGT CAAATACGTA GTCGTTGCAA 
10321 GTCCGCACCA GGTACTGGTA TCCCACCAAA AAGTGCGGCG GCGGCTGGCG GTAGAGGGGC 
10381 CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG ATGATATCCG 
10441 TAGATGTACC TGGACATCCA GGTGATGCCG GCGGCGGTGG TGGAGGCGCG CGGAAAGTCG 
10501 CGGACGCGGT TCCAGATGTT GCGCAGCGGC AAAAAGTGCT CCATGGTCGG GACGCTCTGG 
10561 CCGGTCAGGC GCGCGCAATC GTTGACGCTC TAGACCGTGC AAAAGGAGAG CCTGTAAGCG 
10621 GGCACTCTTC CGTGGTCTGG TGGATAAATT CGCAAGGGTA TCATGGCGGA CGACCGGGGT 
10681 TCGAGCCCCG TATCCGGCCG TCCGCCGTGA TCCATGCGGT TACCGCCCGC GTGTCGAACC 
10741 CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC AGGCGCGGCG 
10801 GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC GT AAGC GGTT AGGCTGGAAA 
10861 GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGGAGGGT TATTTTCCAA GGGTTGAGTC 
10921 GCGGGACCCC CGGTTCGAGT CTCGGACCGG CCGGACTGCG GCGAACGGGG G TTTGCCT CC 
10981 CCGTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTTTTTGC 
11041 TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG 
11101 CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG 
11161 ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGCGGCG CCGGGCCCGG 
11221 CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGCGCC CTCTCCTGAG 
11281 CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 
11341 CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTTCCA CG CA 
11401 GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTGAG 
11461 CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGCGGCCGC CGACCTGGTA 
11521 ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC 
11581 GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT GTGGGACTTT 
11641 GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT GTTCCTTATA 
11701 GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 
11761 GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT GCAGGAGCGC 
11821 AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGGCAAG 
11881 TTTTACGCCC GCAAGATATA CCATACCCCT TACGTTCCCA TAGACAAGGA GGTAAAGATC 
11941 GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGCGTT 
12001 TATCGCAACG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 
12061 CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG CGATAGAGAG 
12121 GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG ACGCGCCCTG 
12181 GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG CAACGTCGGC 
12241 GGCGTGGAGG AATATGACGA GGACGATGAG TACGAGCCAG AGGACGGCGA GTACTAAGCG 
12301 GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GCGGCGCTGC 
12361 AGAGCCAGCC GTCCGGCCTT AACTCCACGG ACGACTGGCG CCAGGTCATG GACCGCATCA 
12421 TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACCGGCTCT 
12481 CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG AAGGTGCTGG 
12541 CGATCGTAAA CGCGCTGGCC GAAAACAGGG CCATCCGGCC CGACGAGGCC GGCCTGGTCT 
12601 ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG ACCAACCTGG 
12661 ACCGGCTGGT GGGGGATGTG CGCGAGGCCG TGGCGCAGCG TGAGCGCGCG CAGCAGCAGG 
12721 GCAACCTGGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACGTGC 
12781 CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG GTGACTGAGA 
12841 CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAGACC AGTAGACAAG 
12901 GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TGGGGGGTGC 
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12961 GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCGCGCCTGT 
13021 TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 
L3081 GTCACTTGCT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GCATGTGGAC GAGCATACTT 
13141 TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCCTGGAGG 
13201 CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 
13261 ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 
L3321 GCGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 
L3381 TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCGCGGCCG 
1.3441 CCGTGAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 
L3501 GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC TGGGACGACA 
L3561 TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 
L3621 AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTG TCCGATCTAG 
13681 GCGCTGCGGC CCCGCGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGGTCTCTTA 
13741 CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 
13801 TGCTGCAGCC GCAGCGCGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 
13861 GCCTAGTGGA CAAGATGAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 
L3921 GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGTGGGAGG 
13981 ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 
L4041 CGCACCTTCG CCCCAGGCTG GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 
14101 AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 
L4161 GCGCGCGGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC 
1422 1 GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 
14281 TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
14341 CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 
14 401 GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTCAAAACA ATGACTACAG 
L4461 CCCGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 
L4S21 CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 
14581 GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
1.4641 ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 
14701 CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 
14761 GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 
14821 CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT 
L4881 GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
14941 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 
15001 CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACACCGAACA 
15061 GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 
15121 CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 
15181 CACCTTTGCC ACACGGGCTG AGGAGAAGCG CGCTGAGGCC GAAGCAGCGG CCGAAGCTGC 
15241 CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA TCAAACCCCT 
15301 GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC AATGACAGCA CCTTCACCCA 
15361 GTACCGCAGC TGGTACCTTG CATACAACTA CGGCGACCCT CAGACCGGAA TCCGCTCATG 
15421 GACCCTGCTT TGCACTCCTG ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC 
15481 AGACATGATG CAAGACCCCG TGACCTTCCG CTCCACGCGC CAGATCAGCA ACTTTCCGGT 
15541 GGTGGGCGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC AGGCCGTCTA 
15601 CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACGTG TTCAATCGCT TTCCCGAGAA 
15661 CCAGATTTTG GCGCGCCCGC CAGCCCCCAC CATCACCACC GTCAGTGAAA ACGTTCCTGC 
15721 TCTCACAGAT CACGGGACGC TACCGCTGCG CAACAGCATC GGAGGAGTCC AGCGAGTGAC 
15781 CATTACTGAC GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
15841 GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA TATCGCCCAG 
15901 CAATAACACA GGCTGGGGCC TGCOCTTCCC AAGCAAGATG TTTGGCGGGG CCAAGAAGCG 
15961 CTCCGACCAA CACCCAGTGC GCGTGCGCGG GCACTACCGC GCGCCCTGGG GCGCGCACAA 
16021 ACGCGGCCGC ACTGGGCGCA CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC 
16081 GCGCAACTAC ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 
16141 GGTGCGCGGA GCCCGGCGCT ATGCTAAAAT GAAGAGACGG CGGAGGCGCG TAGCACGTCG 
16201 CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG GCGGCCCTGC TTAACCGCGC 
16261 ACGTCGCACC GGCCGACGGG CGGCCATGCG GGCCGCTCGA AGGCTGGCCG CGGGTATTGT 
16321 CACTGTGCCC CCCAGGTCCA GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC 
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16381 TATGACTCAG GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 
16441 CGTGCCCGTG CGCACCCGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT ACTTAGACTC 
16501 GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA AGCGCAAAAT 
16561 CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGAGATCTAT GGCCCCCCGA AGAAGGAAGA 
16621 GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA 
16681 TGAACTTGAC GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 
16741 GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT TTACGCCCGG 
16801 TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG ACGAGGACCT 
16861 GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC ATAAGGACAT 
16921 GCTGGCGTTG CCGCTGGACG AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 
16981 GCAGGTGCTG CCCGCGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCGAGTCTGG 
17041 TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG AAGATGTCTT 
17101 GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC CAATCAAGCA 
17161 GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA CCAGTAGCAC 
17221 CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT 
17281 GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 
17341 AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCCCGGCGC CCGCGCGGTT CGAGGAAGTA 
17401 CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 
17461 CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC GCCGAACCAC 
17521 CACTGGAACC CGCCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG 
17581 CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
17641 CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCG 
17701 TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 
17761 CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 
17821 GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC 
17881 CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 
17941 GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA CTATTTTGTA 
18001 GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA CGGCTCGCGC CCGTTCATGG 
18061 GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC 
18121 TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 
18181 ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
18241 TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTGGA CCTGGCCAAC CAGGCAGTGC 
18301 AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACCGGCCG 
18361 TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 
18421 CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 
18481 CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 
18S41 CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCGACCG 
18601 CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 
18661 CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 
18721 GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 
18781 ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC CGCTTTCCAA 
18841 GATGGCTACC CCTTCGATGA TGCCGCAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 
18901 CTCGGAGTAC CTGAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA CGTACTTCAG 
18961 CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGCGCCTACG CACGACGTGA CCACAGACCG 
19021 GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG CGTACTCGTA 
19081 CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG CTTCCACGTA 
19141 CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGGCACTGC 
19201 CTACAACGCC CTGGCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAGCTGCTAC 
19261 TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAGCA 
19321 AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA TAAATATTAC 
19381 AAAGGAGGGT ATTCAAATAG GTGTCGAAGG TCAAACACCT AAATATGCCG ATAAAACATT 
19441 TCAACCTGAA CCTCAAATAG GAGAATCTCA GTGGTACGAA ACTGAAATTA ATCATGCAGC 
19501 TGGGAGAGTC CTTAAAAAGA CTACCCCAAT GAAACCATGT TACGGTTCAT ATGCAAAACC 
19S61 CACAAATGAA AATGGAGGGC AAGGCATTCT TGTAAAGCAA CAAAATGGAA AGCTAGAAAG 
19621 TCAAGTGGAA ATGCAATTTT TCTCAACTAC TGAGGCGACC GCAGGCAATG GTGATAACTT 
19681 GACTCCTAAA GTGGTATTGT ACAGTGAAGA TGTAGATATA GAAACCCCAG ACACTCATAT 
19741 TTCTTACATG CCCACTATTA AGGAAGGTAA CTCACGAGAA CTAATGGGCC AACAATCTAT 
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19801 GCCCAACAGG CCTAATTACA TTGCTTTTAG GGACAATTTT ATTGGTCTAA TGTATTACAA 
19861 CAGCACGGGT AATATGGGTG TTCTGGCGGG CCAAGCATCG CAGTTGAATG CTGTTGTAGA 
19921 TTTGCAAGAC AGAAACACAG AGCTTTCATA CCAGCTTTTG CTTGATTCCA TTGGTGATAG 
19981 AACCAGGTAC TTTTCTATGT GGAATCAGGC TGTTGACAGC TATGATCCAG ATGTTAGAAT 
20041 TATTGAAAAT CATGGAACTG AAGATGAACT TCCAAATTAC TGCTTTCCAC TGGGAGGTGT 
20101 GATTAATACA GAGACTCTTA CCAAGGTAAA ACCTAAAACA GGTCAGGAAA ATGGATGGGA 
20161 AAAAGATGCT ACAGAATTTT CAGATAAAAA TGAAATAAGA GTTGGAAATA ATTTTGCCAT 
20221 GGAAATCAAT CTAAATGCCA ACCTGTGGAG AAATTTCCTG TACTCCAACA TAGCGCTGTA 
20281 TTTGCCCGAC AAGCTAAAGT ACAGTCCTTC CAACGTAAAA ATTTCTGATA ACCCAAACAC 
20341 CTACGACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT ACATTAACCT 
20401 TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC ACCACCGCAA 
20461 TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC CCTTCCACAT 
20521 CCAGGTGCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG GCTCATACAC 
20581 CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC TAGGAAATGA 
20641 CCTAAGGGTT GACGGAGCCA GCATTAAGTT TGATAGCATT TGCCTTTACG CCACCTTCTT 
20701 CCCCATGGCC CACAACACCG CCTCCACGCT TGAGGCCATG CTTAGAAACG ACACCAACGA 
20761 CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC CCGCCAACGC 
20821 TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCCGCG GCTGGGCCTT 
20881 CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC CTTATTACAC 
20941 CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA CCTTTAAGAA 
21001 GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC TGCTTACCCC 
21061 CAACGAGTTT GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACGTTG CCCAGTGTAA 
21121 CATGACCAAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG GCTACCAGGG 
21181 CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA ACTTCCAGCC 
21241 CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG TGGGCATCCT 
21301 ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC GCGAAGGACA 
21361 GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG ACAGCATTAC 
21421 CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TTGGCGCATC CCATTCTCCA GTAACTTTAT 
21481 GTCCATGGGC GCACTCACAG ACCTGGGCCA AAACCTTCTC TACGCCAACT CCGCCCACGC 
21541 GCTAGACATG ACTTTTGAGG TGGATCCCAT GGACGAGCCC ACCCTTCTTT ATGTTTTGTT 
21601 TGAAGTCTTT GACGTGGTCC GTGTGCACCG GCCGCACCGC GGCGTCATCG AAACCGTGTA 
21661 CCTGCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC AACATCAACA 
21721 ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCATTGTCAA AGATCTTGGT 
21781 TGTGGGCCAT ATTTTTTGGG CACCTATGAC AAGCGCTTTC CAGGCTTTGT TTCTCCACAC 
21841 AAGCTCGCCT GCGCCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT ACACTGGATG 
21901 GCCTTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TGGCTTTTCT 
21961 GACCAGCGAC TCAAGCAGGT TTACCAGTTT GAGTACGAGT CACTCCTGCG CCGTAGCGCC 
22021 ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG CGTACAGGGG 
22081 CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT TGCCAACTGG 
22141 CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT ACCCAACTCC 
22201 ATGCTCAACA GTCCCCAGGT ACAGCCCACC CTGCGTCGCA ACCAGGAACA GCTCTACAGC 
22261 TTCCTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAG GAGCGCCACT 
22321 TCTTTTTGTC ACTTGAAAAA CATGTAAAAA TAATGTACTA GAGACACTTT CAATAAAGGC 
22381 AAATGCTTTT ATTTGTACAC TCTCGGGTGA TTATTTACCC CCACCCTTGC CGTCTGCGCC 
22441 GTTTAAAAAT CAAAGGGGTT CTGCCGCGCA TCGCTATGCG CCACTGGCAG GGACACGTTG 
22501 CGATACTGGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG CAGCTCGGTG 
22561 AAGTTTTCAC TCCACAGGCT GCGCACCATC ACCAACGCGT TTAGCAGGTC GGGCGCCGAT 
22621 ATCTTGAAGT CGCAGTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA CACAGGGTTG 
22681 CAGCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT CTTGTCGGAG 
22741 ATCAGATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGGAGTCAA CTTTGGTAGC 
22801 TGCCTTCCCA AAAAGGGCGC GTGCCCAGGC TTTGAGTTGC ACTCGCACCG TAGTGGCATC 
22861 AAAAGGTGAC CGTGCCCGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA AGCCTTGATC 
22921 TGCTTAAAAG CCACCTGAGC CTTTGCGCCT TCAGAGAAGA ACATGCCGCA AGACTTGCCG 
22981 GAAAACTGAT TGGCCGGACA GGCCGCGTCG TGCACGCAGC ACCTTGCGTC GGTGTTGGAG 
23041 ATCTGCACCA CATTTCGGCC CCACCGGTTC TTCACGATCT TGGCCTTGCT AGACTGCTCC 
23101 TTCAGCGCGC GCTGCCCGTT TTCGCTCGTC ACATCCATTT CAATCACGTG CTCCTTATTT 
23161 ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGCGCA GCGGTGCAGC 
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23221 CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA CGACTGCAGG 
23281 TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGGTCT TGTTGCTGGT GAAGGTCAGC 
23341 TGCAACCCGC GGTGCTCCTC GTTCAGCCAG GTCTTGCATA CGGCCGCCAG AGCTTCCACT 
23401 TGGTCAGGCA GTAGTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA CTTGTCCATC 
23461 AGCGCGCGCG CAGCCTCCAT GCCCTTCTCC CACGCAGACA CGATCGGCAC ACTCAGCGGG 
23521 TTCATCACCG TAATTTCACT TTCCGCTTCG CTGGGCTCTT CCTCTTCCTC TTGCGTCCGC 
23581 ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTGTGCGCTT ACCTCCTTTG 
23641 CCATGCTTGA TTAGCACCGG TGGGTTGCTG AAACCCACCA TTTGTAGCGC CACATCTTCT 
23701 CTTTCTTCCT CGCTGTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 
23761 GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG CCGCCGAGGT CGATGGCCGC 
23821 GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 
23881 ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCGA CGGGGACGGG 
23941 GACGACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 
24001 GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA GAAAAAGATC 
24061 ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 
24121 TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGAG 
24181 GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 
24241 GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA GGAACAAGTC 
24301 GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAGACGACGT GCTGTTGAAG 
24361 CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CGATGTGCCC 
24421 CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC GCGCGTACCC 
24481 CCCAAACGCC AAGAAAACGG CACATGCGAG CCCAACCCGC GCCTCAACTT CTACCCCGTA 
24541 TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG CAAGATACCC 
24601 CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG GCAGGGCGCT 
24661 GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG TCTTGGACGC 
24721 GACGAGAAGC GCGCGGCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA AAGTCACTCT 
24781 GGAGTGTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA ACGCAGCATC 
24841 GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT GAGCACAGTC 
24901 ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA TTTGCAAGAA 
24961 CAAACAGAGG AGGGCCTACC CGCAGTTGGC GACGAGCAGC TAGCGCGCTG GCTTCAAACG 
25021 CGCGAGCCTG CCGACTTGGA GGAGCGACGC AAACTAATGA TGGCCGCAGT GCTCGTTACC 
25081 GTGGAGCTTG AGTGCATGCA GCGGTTCTTT GCTGACCCGG AGATGCAGCG CAAGCTAGAG 
25141 GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACGCC AGGCCTGCAA GATCTCCAAC 
25201 GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG CCTTGGGCAA 
25261 AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGCGCCGCG ACTACGTCCG CGACTGCGTT 
25321 TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA GTGCTTGGAG 
2 53 81 GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA CCTATGGACG 
25441 GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCGGACA TCATTTTCCC CGAACGCCTG 
25501 CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT GCAGAACTTT 
2S561 AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA CCTGCTGTGC ACTTCCTAGC 
25621 GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCCGCCGC TTTGGGGCCA CTGCTACCTT 
25681 CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGGAAGACGT GAGCGGTGAC 
25741 GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC CGCACCGCTC CCTGGTTTGC 
25801 AATTCGCAGC TGCTTAACGA AAGTCAAATT ATCGGTACCT TTGAGCTGCA GGGTCCCTCG 
25861 CCTGACGAAA AGTCCGCGGC TCCGGGGTTG AAACTCACTC CGGGGCTGTG GACGTCGGCT 
25921 TACCTTCGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT CTACGAAGAC 
25981 CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTGCGTCA TTACCCAGGG CCACATTCTT 
26041 GGCCAATTGC AAGCCATCAA CAAAGCCCGC CAAGAGTTTC TGCTACGAAA GGGACGGGGG 
26101 GTTTACTTGG ACCCCCAGTC CGGCGAGGAG CTCAACCCAA TCCCCCCGCC GCCGCAGCCC 
26161 TATCAGCAGC AGCCGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA AGCTGCAGCT 
26221 GCCGCCGCCA CCCACGGACG AGGAGGAATA CTGGGACAGT CAGGCAGAGG AGGTTTTGGA 
26281 CGAGGAGGAG GAGGACATGA TGGAAGACTG GGAGAGCCTA GACGAGGAAG CTTCCGAGGT 
26341 CGAAGAGGTG TCAGACGAAA CACCGTCACC CTCGGTCGCA TTCCCCTCGC CGGCGCCCCA 
26401 GAAATCGGCA ACCGGTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC CGCCGGCACT 
26461 GCCCGTTCGC CGACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG GTAAGTCCAA 
26521 GCAGCCGCCG CCGTTAGCCC AAGAGCAACA ACAGCGCCAA GGCTACCGCT CATGGCGCGG 
26581 GCACAAGAAC GCCATAGTTG CTTGC TT G CA AGACTGTGGG GGCAACATCT CCTTCGCCCG 
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26641 CCGCTTTCTT CTCTACCATC ACGGCGTGGC CTTCCCCCGT AACATCCTGC ATTACTACCG 
26701 TCATCTCTAC AGCCCATACT GCACCGGCGG CAGCGGCAGC GGCAGCAACA GCAGCGGCCA 
26761 CACAGAAGCA AAGGCGACCG GATAGCAAGA CTCTGACAAA GCCCAAGAAA TCCACAGCGG 
26821 CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA CGAACCCGTA TCGACCCGCG 
26881 AGCTTAGAAA CAGGATTTTT CCCACTCTGT ATGCTATATT TCAACAGAGC AGGGGCCAAG 
26941 AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCGATCCCT CACCCGCAGC TGCCTGTATC 
27001 ACAAAAGCGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC TTCAGTAAAT 
27061 ACTGCGCGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA GCGCGAAAAC 
27121 TACGTCATCT CCAGCGGCCA CACCCGGCGC CAGCACCTGT CGTCAGCGCC ATTATGAGCA 
27181 AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGGACTT GCGGCTGGAG 
27241 CTGCCCAAGA CTACTCAACC CGAATAAACT ACATGAGCGC GGGACCCCAC ATGATATCCC 
27301 GGGTCAACGG AATCCGCGCC CACCGAAACC GAATTCTCTT GGAACAGGCG GCTATTACCA 
27361 CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG TACCAGGAAA 
27421 GTCCCGCTCC CACCACTGTG GTACTTCCCA GAGACGCCCA GGCCGAAGTT CAGATGACTA 
27481 ACTCAGGGGC GCAGCTTGCG GGCGGCTTTC GTCACAGGGT GCGGTCGCCC GGGCAGGGTA 
27541 TAACTCACCT GACAATCAGA GGGCGAGGTA TTCAGCTCAA CGACGAGTCG GTGAGCTCCT 
27601 CGCTTGGTCT CCGTCCGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT CCTTCATTCA 
27661 CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC TCTGGAGGCA 
27721 TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC CCCTTCTCGG 
27781 GACCTCCCGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 
27841 CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 
27901 TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 
27961 TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 
28021 TTGCCCGTAG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 
28081 GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTTT 
28141 GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
28201 CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT 
28261 GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
28321 CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
28381 TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
28441 CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
28501 AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
28561 CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
28621 TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 
28681 TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGATT 
28741 AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
28801 GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAQCTAATGA GTGCACCACT 
28861 CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
28921 AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
28981 TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
29041 ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
29101 ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TCGCTTTGGT CTGTACCCTA 
29161 CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
29221 ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 
29281 CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
29341 ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
29401 CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
29461 GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
29521 ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 
29581 AACTTGGGCA TGTGGTGGTT CTCCATAGCG CTTATGTTTG TATGCCTTAT TATTATGTGG 
29641 CTCATCTGCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC CATCATTGTG 
29701 CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 
29761 CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG ACCCTTGTTG 
29821 CGCTTTTTTG TGCGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 
29881 CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTGTCACCCT CACGCTCATC TGCAGCCTCA 
29941 TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC TTTGCATATC 
30001 TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTCTTTAAT 
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30061 TATGAAATTT ACTGTGACTT TTCTGCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 
30121 GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 
30181 TTGCTACAAT GAAAAAAGCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCTGTTAT 
30241 GGTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 
30301 ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTATGCTTC CACTGCAACA 
30361 AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 
30421 TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT CTAGAAATGG 
30481 ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 
30541 GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 
30601 GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 
30661 ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 
30721 TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 
30781 ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 
30841 AAAAAAAAAT AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 
30901 TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 
30961 AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 
31021 ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC CTTCAACCCC 
31081 GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 
31141 GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 
31201 CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC 
31261 GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 
31321 AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 
31381 GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA ATCACAGGCC 
31441 CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 
3 IS 01 GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 
31561 ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 
31621 GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 
31681 ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 
31741 TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 
31801 AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT TGATGTTAGT 
31861 TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 
31921 AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 
31981 AACAATTCCA AAAAGCTTGA GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 
32041 ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 
32101 ACAAATCCCC TCAAAACAAA AATTGGCCAT GGCCTAGAAT TTGATTCAAA CAAGGCTATG 
32161 GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AGTAGGAAAC 
32221 AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 
32281 AATGCAGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 
32341 GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 
32401 AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 
32461 GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 
32521 GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 
32581 AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 
32641 ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 
32701 TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATCCTCTTAC 
32761 ACTTTTTCAT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT 
32821 TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC CACCACCACA 
32881 TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 
32941 ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TAAAAAGCAT 
33001 CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 
33061 CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 
33121 GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 
33181 AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGCGGTGGTG 
33241 CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG AATACAACAT 
33301 GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC TTGTCCTCCG 
33361 GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 
33421 AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG CGGGGACCAC 



FIGURE 21 
(SHEET 10) 

3^6 



Copied from 10206747 on 07-04-2004 



WO 01/04282 



PCT/US00/18971 



334 81 AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGGCGAC CCCTCATAAA 
33541 CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 
33601 TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGGCCAAAAC 
33661 CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GGAGAGCCCA 
33721 GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA ATGTTGGCAC AACACAGGCA 
33781 CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 
33841 AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC GCACGTAACT 
33901 CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 
33961 AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACGGAG TGCGCCGAGA 
34021 CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 
34081 TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG TCTCGCCGCT 
34141 TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC AGGCGCCCCC 
34201 TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTGATAACA TCCACCACCG 
34261 CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC ACGGGAGGAG 
34321 CGGGAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC CAAAACCTCA 
34381 AAATGAAGAT CTATTAAGTG AACGCGCTCC CCTCCGGTGG CGTGGTCAAA CTCTACAGCC 
34441 AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG GCAAACGGCC 
34501 CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT GAATCTCCTC TATAAACATT 
34561 CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA TATATCTCTA 
34621 AGCAAATCCC GAATATTAAG TCCGGCCATT GTAAAAATCT GCTCCAGAGC GCCCTCCACC 
34681 TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG ACCTGTATAA 
3 4741 GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG TAGGTCCCTT CGCAGGGCCA 
34801 GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC CACTTCCCCG CCAGGAACCT 
34861 TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA ACCAGCGTAG 
34921 CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT GCAAGGTGCT GCTCAAAAAA 
34 981 TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG CAGATAAAGG 
35041 CAGGTAAGCT CCGGAACCAC CACAGAAAAA > GACACCATTT TTCTCTCAAA CATGTCTGCG 
35101 GGTTTCTGCA TAAACACAAA ATAAAATAAc' AAAAAAACAT TTAAACATTA GAAGCCTGTC 
35161 TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC TACGGCCATG CCGGCGTGAC 
35221 CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA CAGCTCCTCG GTCATGTCCG 
35281 GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT CATCGGTCAG TGCTAAAAAG 
3 5341 CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA GAGACAACAT TACAGCCCCC 
35401 ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA AAAACCCTCC 
35461 TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC ACAGCGGCAG 
35521 CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC ACTCGACACG 
35581 GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG TATATATAGG 
35641 ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC GCACGCGAAC 
35701 CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA CTTCCGTTTT 
35761 CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT ACAAGTTACT 
35821 CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC CGCGCCACGT CACAAACTCC 
3S8B1 ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA TGATG 
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1. .33592 



/gene="KDl" 
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BASE COUNT 7744 a 9470 c 9285 g 7093 t 

ORIGIN 

1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG T GGCA AAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCGAAGA 
661 TCCCAACGAG GAGGCGGTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTGCAGGA 
721 AGGGATTGAC TTACTCACTT TTCCGCCGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
781 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAGAGGG TGAGGAGTTT GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGG AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AG TGAAAA TT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTGAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACCGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AG CTGTG ACT CCGGTCCTTC TAACACACCT CCTGAGATAC ACCCGGTGGT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACGCCC CAGGCCATAA 
1441 GGTGTAAACC TGTGATTGCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGCGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 GAGTGTTTGG AAGATTTTTC TGCTGTGCGT AACTTGCTGG AACAGAGCTC TAACAGTACC 
1681 TCTTGGTTTT GGAGGTTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAATTAAG 
1741 GAGGATTACA AGTGGGAATT TGAAGAGCTT TTGAAATCCT GTGGTGAGCT GTTTGATTCT 
1801 TTGAATCTGG GTCACCAGGC GCTTTTCCAA GAGAAGGTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GCGCTGCGGC TGCTGTTGCT TTTTTGAGTT TTATAAAGGA TAAATGGAGC 
1921 GAAGAAACCC ATCTGAGCGG GGGGTACCTG CTGGATTTTC TGGCCATGCA TCTGTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTGTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAGC 
2101 CCATGGAACC CGAGAGCCGG CCTGGACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGGA TGGGCAGGGG C TAAAGG GGG 
2221 TAAAGAGGGA GCGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGGAATCTA GCTTTTAGCT 
2281 TAATGACCAG ACACCGTCCT GAGTGTATTA CTTTTCAACA GATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTGA TCTGCTGGCG CAGAAGTATT CCATAGAGCA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGGA TGATTTTGAG GAGGCTATTA GGGTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC ATTTCTGGGA 
2521 ACGGGGCCGA GGTGGAGATA GATACGGAGG ATAGGGTGGC CTTTAGATGT AGCATGATAA 
2581 ATATGTGGCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCGGTACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCGGGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TGAGGGTAAC TCCAGGGTGC 
2881 GCCACAATGT GGCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGGAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT GAACCTGAAC GGGGTGTTTG ACATGACCAT GAAGATCTGG AAGGTGCTGA 
3241 GGTACGATGA GACCCGCACC AGGTGCAGAC CCTGCGAGTG TGGCGGTAAA CATATTAGGA 
3301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CGATCACTTG GTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAATGTGTG 
3421 GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTGCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCCGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACCGTGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCGCTTC AGCCGCTGCA GCCACCGCCC 
3721 GCGGGATTGT GACTGACTTT GCTTTCCTGA GCCCGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTGGATTCT TTGACCCGGG 
3841 AACTTAATGT CGTTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCTGAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATAAAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTGCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CGCTGGGCGT 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCCC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT GTCATGTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCGTCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCGG CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CGCGGGCGGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTTA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCGACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAGAGAGCT GCAGCTGCCG TCATCCCTGA GCAGGGGGGC CACTTCGTTA AGCATGTCCC 
4921 TGACTCGCAT GTTTTCCCTG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGATAGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAACG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TGAGCGTTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCACCTGC TCTACGGCAT 
5101 CTCGATCCAG CATATCTCCT CGTTTCGCGG GTTGGGGCGG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCGTCAG 
5221 CGTAGTCTGG GTCACGGTGA AGGGGTGCGC TCCGGGCTGC GCGCTGGCCA GGGTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTG CCGGTCTTCG CCCTGCGCGT CGGCCAGGTA 
5341 GCATTTGACC ATGGTGTCAT AGTCCAGCCC CTCCGCGGCG TGGCCCTTGG CGCGCAGCTT 
5401 GCCCTTGGAG GAGGCGCCGC ACGAGGGGCA GTGCAGACTT TTGAGGGCGT AGAGCTTGGG 
5461 CGCGAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGGCCCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGCCG TTCGGGGTCA AAAACCAGGT TTCCCCCATG 
5581 CTTTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCGG TGTCCACGCT CGGTGACGAA 
5641 AAGGCTGTCC GTGTCCCCGT ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTGTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGGACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGGTCCA CTCGCTCCAG 
5821 GGTGTGAAGA CACATGTCGC CCTCTTCGGC ATCAAGGAAG GTGATTGGTT TGTAGGTGTA 
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5881 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG CGCGTTCGTC 
5941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGGTGAGT ACTCCCTCTG 
6001 AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG CCGCGATGTT 
6241 TAGCTGCACG TATTCGCGCG CAACGCACCG CCATTCGGGA AAGACGGTGG TGCGCTCGTC 

63 01 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGGTG ACAAGGTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CCGCCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCGTCTCGTC CGGGGGGTCT GCGTCCACGG TAAAGACCCC 

64 81 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TGCAAGTCTA GCGCCTGCTG 
6541 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCGTAA ACGTAGAGGG GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAGGGAGCGA GGAGGTCGGG ACCGAGGTTG CTACGGGCGG GCTGCTCTGC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACGTTGAAG CTGGCGTCTG TGAGACCTAC CGCGTCACGC ACGAAGGAGG CGTAGGAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGCGCAGT AGTCCAGGGT 
6961 TTCCTTGATG ATGTCATACT TATCCTGTCC CTTTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCGCGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGTGTCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACG CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTGCGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCGGAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGACGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCGGGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGGTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACGAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAAGAGACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCGCCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGACGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
B041 GACCTGACGA CCGCGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTCGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCCGCGCGA GCCCAAAGTC CAGATGTCCG CGCGCGGCGG 
8221 TCGGAGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCCGCGG 
8281 CGTCAGGTCA GGCGGGAGCT CCTGCAGGTT TACCTCGCAT AGACGGGTCA GGGCGCGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGGCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCGCGCGGC GGGCGGTGGG CCGCGGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCCGGAGG TAGGGGGGGC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGCGCGCGGG CAGGAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCCTCTGCG TGAAGACGAC GGGCCCGGTG AGCTTGAGCC TGAAAGAGAG TTCGACAGAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CGCAAAATCT CCTGCACGTC TCCTGAGTTG 
8761 TCTTGATAGG CGATCTCGGC CATGAACTGC TCGATCTCTT CCTCCTGGAG ATCTCCGCGT 
8821 CCGGCTCGCT CCACGGTGGC GGCGAGGTCG TTGGAAATGC GGGCCATGAG CTGCGAGAAG 
8881 GCGTTGAGGC CTCCCTCGTT CCAGACGCGG CTGTAGACCA CGCCCCCTTC GGCATCGCGG 
8941 GCGCGCATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCGAAGAC GGCGTAGTTT 
9001 CGCAGGCGCT GAAAGAGGTA GTTGAGGGTG GTGGCGGTGT GTTCTGCCAC GAAGAAGTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAG GCGCTCCATG 
9121 GCCTCGTAGA AGTCCACGGC GAAGTTGAAA AACTGGGAGT TGCGCGCCGA CACGGTTAAC 
9181 TCCTCCTCCA GAAGACGGAT GAGCTCGGCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 
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9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG CGGCGACGAC GGCGCACCGG GAGGCGGTCG 
9361 ACAAAGCGCT CGATCATCTC CCCGCGGCGA CGGCGCATGG TCTCGGTGAC GGCGCGGCCG 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG CCCGTCATGT CCCGGTTATG GGTTGGCGGG 
9481 GGGCTGCCAT GCGGCAGGGA TACGGCGCTA ACGATGCATC TCAACAATTG TTGTGTAGGT 
9541 ACTCCGCCGC CGAGGGACCT GAGCGAGTCC GCATCGACCG GATCGGAAAA CCTCTCGAGA 
9601 AAGGCGTCTA ACCAGTCACA GTCGCAAGGT AGGCTGAGCA CCGTGGCGGG CGGCAGCGGG 
9661 CGGCGGTCGG GGTTGTTTCT GGCGGAGGTG CTGCTGATGA TGTAATTAAA GTAGGCGGTC 
9721 TTGAGACGGC GGATGGTCGA CAGAAGCACC ATGTCCTTGG GTCCGGCCTG CTGAATGCGC 
9781 AGGCGGTCGG CCATGCCCCA GGCTTCGTTT TGACATCGGC GCAGGTCTTT GTAGTAGTCT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
9901 TCTATCGCTG CGGCGGCGGC GGAGTTTGGC CGTAGGTGGC GCCCTCTTCC TCCCATGCGT 
9961 GTGACCCCGA AGCCCCTCAT CGGCTGAAGC AGGGCTAGGT CGGCGACAAC GCGCTCGGCT 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA GACTGGAAGT CATCCATGTC CACAAAGCGG 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG CAGTTGGCCA TAACGGACCA GTTAACGGTC 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC CTGAGACGCG AGTAAGCCCT CGAGTCAAAT 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC TGGTATCCCA CCAAAAAGTG CGGCGGCGGC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC GGGGCTCCGG GGGCGAGATC TTCCAACATA 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGGAC ATCCAGGTGA TGCCGGCGGC GGTGGTGGAG 
10381 GCGCGCGGAA AGTCGCGGAC GCGGTTCCAG ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
10441 GTCGGGACGC TCTGGCCGGT CAGGCGCGCG CAATCGTTGA CGCTCTAGCG TGCAAAAGGA 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC TGGTGGATAA ATTCGCAAGG GTATCATGGC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG CCGTCCGCCG TGATCCATGC GGTTACCGCC 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT TTTGGCCACT GGCCGCGCGC AGCGTAAGCG 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
10801 CAAGGGTTGA GTCGCGGGAC CCCCGGTTCG AGTCTCGGAC CGGCCGGACT GCGGCGAACG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC GCTTGCAAAT TCCTCCGGAA ACAGGGACGA 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC GGTGCTGCGG CAGATGCGCC CCCCTCCTCA 
10981 GCAGCGGCAA GAGCAAGAGC AGCGGCAGAC ATGCAGGGCA CCCTCCCCTC CTCCTACCGC 
11041 GTCAGGAGGG GCGACATCCG CGGTTGACGC GGCAGCAGAT GGTGATTACG AACCCCCGCG 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA GGAGGGCGAG GGCCTGGCGC GGCTAGGAGC 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA GCTGAAGCGT GATACGCGTG AGGCGTACGT 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA GGGAGAGGAG CCCGAGGAGA TGCGGGATCG 
11281 AAAGTTCCAC GCAGGGCGCG AGCTGCGGCA TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
11341 GGAGGACTTT GAGCCCGACG CGCGAACCGG GATTAGTCCC GCGCGCGCAC ACGTGGCGGC 
11401 CGCCGACCTG GTAACCGCAT ACGAGCAGAC GGTGAACCAG GAGATTAACT TTCAAAAAAG 
11461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC GCGCGAGGAG GTGGCTATAG GACTGATGCA 
11521 TCTGTGGGAC TTTGTAAGCG CGCTGGAGCA AAACCCAAAT AGCAAGCCGC TCATGGCGCA 
11581 GCTGTTCCTT ATAGTGCAGC ACAGCAGGGA CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
11641 CATAGTAGAG CCCGAGGGCC GCTGGCTGCT CGATTTGATA AACATCCTGC AGAGCATAGT 
11701 GGTGCAGGAG CGCAGCTTGA GCCTGGCTGA CAAGGTGGCC GCCATCAACT ATTCCATGCT 
11761 TAGCCTGGGC AAGTTTTACG CCCGCAAGAT ATACCATACC CCTTACGTTC CCATAGACAA 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG CATGGCGCTG AAGGTGCTTA CCTTGAGCGA 
11881 CGACCTGGGC GTTTATCGCA ACGAGCGCAT CCACAAGGCC GTGAGCGTGA GCCGGCGGCG 
11941 CGAGCTCAGC GACCGCGAGC TGATGCACAG CCTGCAAAGG GCCCTGGCTG GCACGGGCAG 
12001 CGGCGATAGA GAGGCCGAGT CCTACTTTGA CGCGGGCGCT GACCTGCGCT GGGCCCCAAG 
12061 CCGACGCGCC CTGGAGGCAG CTGGGGCCGG ACCTGGGCTG GCGGTGGCAC CCGCGCGCGC 
12121 TGGCAACGTC GGCGGCGTGG AGGAATATGA CGAGGACGAT GAGTACGAGC CAGAGGACGG 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAG ATGATGCAAG ACGCAACGGA CCCGGCGGTG 
12241 CGGGCGGCGC TGCAGAGCCA GCCGTCCGGC CTTAACTCCA CGGACGACTG GCGCCAGGTC 
12301 ATGGACCGCA TCATGTCGCT GACTGCGCGC AATCCTGACG CGTTCCGGCA GCAGCCGCAG 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCG GTGGTCCCGG CGCGCGCAAA CCCCACGCAC 
12421 GAGAAGGTGC TGGCGATCGT AAACGCGCTG GCCGAAAACA GGGCCATCCG GCCCGACGAG 
12481 GCCGGCCTGG TCTACGACGC GCTGCTTCAG CGCGTGGCTC GTTACAACAG CGGCAACGTG 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT GTGCGCGAGG CCGTGGCGCA GCGTGAGCGC 
12601 GCGCAGCAGC AGGGCAACCT GGGCTCCATG GTTGCACTAA ACGCCTTCCT GAGTACACAG 
12661 CCCGCCAACG TGCCGCGGGG ACAGGAGGAC TACACCAACT TTGTGAGCGC ACTGCGGCTA 
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12721 ATGGTGACTG AGACACCGCA AAGTGAGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAGAC AAGGCCTGCA GACCGTAAAC CTGAGCCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC CGCGCGACCG TGTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTGTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCGAGG CCATAGGTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGG AGGCAACCCT AAACTACCTG CTGACCAACC GGCGGCAGAA GATCCCCTCG 
13141 TTGCACAGTT TAAACAGCGA GGAGGAGCGC ATTTTGCGCT ACGTGCAGCA GAGCGTGAGC 
13201 CTTAACCTGA TGCGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATGC CTCAAACCGG CCGTTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGCGG CCGCCGTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGGGGGA TTCGAGGTGC CCGAGGGTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGCGTGTTT TCCCCGCAAC CGCAGACCCT GCTAGAGTTG 
13501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTGTCCGATC TAGGCGCTGC GGCCCCGCGG TCAGATGCTA GTAGCCCATT TCCAAGCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAACAACT CGCTGCTGCA GCCGCAGCGC GAAAAAAACC TGCCTCCGGC ATTTCCCAAC 
13741 AACGGGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AGACGTACGC GCAGGAGCAC 
13801 AGGGACGTGC CAGGCCCGCG CCCGCCCACC CGTCGTCAAA GGCACGACCG TCAGCGGGGT 
13861 CTGGTGTGGG AGGACGATGA CTCGGCAGAC GACAGCAGCG TCCTGGATTT GGGAGGGAGT 
13921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
13981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
14041 CCCTTAGTAT GCGGCGCGCG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
14101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
14161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCGGGGG GAGAAACAGC ATCCGTTACT 
14221 CTGAGTTGGC ACCCCTATTC GACACCACCC GTGTGTACCT GGTGGACAAC AAGTCAACGG 
14281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
14401 ACTGGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCCAAATGTG AACGAGTTCA 
14461 TGTTTACCAA TAAGTTTAAG GCGCGGGTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
14521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACGCGA TCGTGGAGCA CTACTTGAAA GTGGGCAGAC 
14641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCGCAAC TTCAGACTGG 
14701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
14761 CAGACATCAT TTTGCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
14821 TGTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGGATCACC TACGATGATC 
14881 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTGAAAG 
14941 ATGACACCGA ACAGGGCGGG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
15001 AAGAGAACTC CAACGCGGCA GCCGCGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
15061 CCATTCGCGG CGACACCTTT GCCACACGGG CTGAGGAGAA GCGCGCTGAG GCCGAAGCAG 
15121 CGGCCGAAGC TGCCGCCCCC GCTGCGCAAC CCGAGGTCGA GAAGCCTCAG AAGAAACCGG 
15181 TGATCAAACC CCTGACAGAG GACAGCAAGA AACGCAGTTA CAACCTAATA AGCAATGACA 
15241 GCACCTTCAC CCAGTACCGC AGCTGGTACC TTGCATACAA CTACGGCGAC CCTCAGACCG 
15301 GAATCCGCTC ATGGACCCTG CTTTGCACTC CTGACGTAAC CTGCGGCTCG GAGCAGGTCT 
15361 ACTGGTCGTT GCCAGACATG ATGCAAGACC CCGTGACCTT CCGCTCCACG CGCCAGATCA 
15421 GCAACTTTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAGAGC TTCTACAACG 
15481 ACCAGGCCGT CTACTCCCAA CTCATCCGCC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
15541 GCTTTCCCGA GAACCAGATT TTGGCGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGTG 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCGCT GCGCAACAGC ATCGGAGGAG 
15661 TCCAGCGAGT GACCATTACT GACGCCAGAC GCCGCACCTG CCCCTACGTT TACAAGGCCC 
15721 TGGGCATAGT CTCGCCGCGC GTCCTATCGA GCCGCACTTT TTGAGCAAGC ATGTCCATCC 
15781 TTATATCGCC CAGCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAGCAAG ATGTTTGGCG 
15841 GGGCCAAGAA GCGCTCCGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCGCGCCCT 
15901 GGGGCGCGCA CAAACGCGGC CGCACTGGGC GCACCACCGT CGATGACGCC ATCGACGCGG 
15961 TGGTGGAGGA GGCGCGCAAC TACACGCCCA CGCCGCCACC AGTGTCCACA GTGGACGCGG 
16021 CCATTCAGAC CGTGGTGCGC GGAGCCCGGC GCTATGCTAA AATGAAGAGA CGGCGGAGGC 
16081 GCGTAGCACG TCGCCACCGC CGCCGACCCG GCACTGCCGC CCAACGCGCG GCGGCGGCCC 
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16141 TGCTTAACCG CGCACGTCGC ACCGGCCGAC GGGCGGCCAT GCGGGCCGCT CGAAGGCTGG 
16201 CCGCGGGTAT TGTCACTGTG CCCCCCAGGT CCAGGCGACG AGCGGCCGCC GCAGCAGCCG 
16261 CGGCCATTAG TGCTATGACT CAGGGTCGCA GGGGCAACGT GTATTGGGTG CGCGACTCGG 
16321 TTAGCGGCCT GCGCGTGCCC GTGCGCACCC GCCCCCCGCG CAACTAGATT GCAAGAAAAA 
16381 ACTACTTAGA CTCGTACTGT TGTATGTATC CAGCGGCGGC GGCGCGCAAC GAAGCTATGT 
16441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC AGGTCATCGC GCCGGAGATC TATGGCCCCC 
16501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
16561 AAGATGATGA TGATGAACTT GACGACGAGG TGGAACTGCT GCACGCTACC GCGCCCAGGC 
16621 GACGGGTACA GTGGAAAGGT CGACGCGTAA AACGTGTTTT GCGACCCGGC ACCACCGTAG 
16681 TCTTTACGCC CGGTGAGCGC TCCACCCGCA CCTACAAGCG CGTGTATGAT GAGGTGTACG 
16741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
16801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
16861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC TTGCACCGTC CGAAGAAAAG CGCGGCCTAA 
16921 AGCGCGAGTC TGGTGACTTG GCACCCACCG TGCAGCTGAT GGTACCCAAG CGCCAGCGAC 
16981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG AACCTGGGCT GGAGCCCGAG GTCCGCGTGC 
17041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG GCGTGCAGAC CGTGGACGTT CAGATACCCA 
17101 CTACCAGTAG CACCAGTATT GCCACCGCCA CAGAGGGCAT GGAGACACAA ACGTCCCCGG 
17161 TTGCCTCAGC GGTGGCGGAT GCCGCGGTGC AGGCGGTCGC TGCGGCCGCG TCCAAGACCT 
17221 CTACGGAGGT GCAAACGGAC CCGTGGATGT TTCGCGTTTC AGCCCCCCGG CGCCCGCGCG 
17281 GTTCGAGGAA GTACGGCGCC GCCAGCGCGC TACTGCCCGA ATATGCCCTA CATCCTTCCA 
17341 TTGCGCCTAC CCCCGGCTAT CGTGGCTACA CCTACCGCCC CAGAAGACGA GCAACTACCC 
17401 GACGCCGAAC CACCACTGGA ACCCGCCGCC GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
17461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
17521 GCTACCACCC CAGCATCGTT TAAAAGCCGG TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
17581 CCTGCCGCCT CCGTTTCCCG GTGCCGGGAT TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
17641 TGGCCGGCCA CGGCCTGACG GGCGGCATGC GTCGTGCGCA CCACCGGCGG CGGCGCGCGT 
17701 CGCACCGTCG CATGCGCGGC GGTATCCTGC CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
17761 TTGGCGCCGT GCCCGGAATT GCATCCGTGG CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
17821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA AGTCTGGACT CTCACGCTCG CTTGGTCCTG 
17881 TAACTATTTT GTAGAATGGA AGACATCAAC TTTGCGTCTC TGGCCCCGCG ACACGGCTCG 
17941 CGCCCGTTCA TGGGAAACTG GCAAGATATC GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
18001 AGCTGGGGCT CGCTGTGGAG CGGCATTAAA AATTTCGGTT CCACCGTTAA GAACTATGGC 
18061 AGCAAGGCCT GGAACAGCAG CACAGGCCAG ATGCTGAGGG ATAAGTTGAA AGAGCAAAAT 
18121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
18181 AACCAGGCAG TGCAAAATAA GATTAACAGT AAGCTTGATC CCCGCCCTCC CGTAGAGGAG 
18241 CCTCCACCGG CCGTGGAGAC AGTGTCTCCA GAGGGGCGTG GCGAAAAGCG TCCGCGCCCC 
18301 GACAGGGAAG AAACTCTGGT GACGCAAATA GACGAGCCTC CCTCGTACGA GGAGGCACTA 
18361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
18421 CACACACCCG TAACGCTGGA CCTGCCTCCC CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
18481 CCAGGCCCGA CCGCCGTTGT TGTAACCCGT CCTAGCCGCG CGTCCCTGCG CCGCGCCGCC 
18541 AGCGGTCCGC GATCGTTGCG GCCCGTAGCC AGTGGCAACT GGCAAAGCAC ACTGAACAGC 
18601 ATCGTGGGTC TGGGGGTGCA ATCCCTGAAG CGCCGACGAT GCTTCTGAAT AGCTAACGTG 
18661 TCGTATGTGT GTCATGTATG CGTCCATGTC GCCGCCAGAG GAGCTGCTGA GCCGCCGCGC 
18721 GCCCGCTTTC CAAGATGGCT ACCCCTTCGA TGATGCCGCA GTGGTCTTAC ATGCACATCT 
18781 CGGGCCAGGA CGCCTCGGAG TACCTGAGCC CCGGGCTGGT GCAGTTTGCC CGCGCCACCG 
18841 AGACGTACTT CAGCCTGAAT AACAAGTTTA GAAACCCCAC GGTGGCGCCT ACGCACGACG 
18901 TGACCACAGA CCGGTCCCAG CGTTTGACGC TGCGGTTCAT CCCTGTGGAC CGTGAGGATA 
18961 CTGCGTACTC GTACAAGGCG CGGTTCACCC TAGCTGTGGG TGATAACCGT GTGCTGGACA 
19021 TGGCTTCCAC GTACTTTGAC ATCCGCGGCG TGCTGGACAG GGGCCCTACT TTTAAGCCCT 
19081 ACTCTGGCAC TGCCTACAAC GCCCTGGCTC CCAAGGGTGC CCCAAATCCT TGCGAATGGG 
19141 ATGAAGCTGC TACTGCTCTT GAAATAAACC TAGAAGAAGA GGACGATGAC AACGAAGACG 
19201 AAGTAGACGA GCAAGCTGAG CAGCAAAAAA CTCACGTATT TGGGCAGGCG CCTTATTCTG 
19261 GTATAAATAT TACAAAGGAG GGTATTCAAA TAGGTGTCGA AGGTCAAACA CCTAAATATG 
19321 CCGATAAAAC ATTTCAACCT GAACCTCAAA TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
19381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA AGACTACCCC AATGAAACCA TGTTACGGTT 
19441 CATATGCAAA ACCCACAAAT GAAAATGGAG GGCAAGGCAT TCTTGTAAAG CAACAAAATG 
19501 GAAAGCTAGA AAGTCAAGTG GAAATGCAAT TTTTCTCAAC TACTGAGGCG ACCGCAGGCA 
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AGACTGGGGG 
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AGTCCACCCA 
TTCTCCACGC 
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CTAGAGACAC 
CCCCCACCCT 
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CAACCATCCG 
CGTTTAGCAG 
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AGAACATGCC 
AGCACCTTGC 
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22981 GCTAGACTGC TCCTTCAGCG CGCGCTGCCC GTTTTCGCTC GTCACATCCA TTTCAATCAC 
23041 GTGCTCCTTA TTTATCATAA TGCTTCCGTG TAGACACTTA AGCTCGCCTT CGATCTCAGC 
23101 GCAGCGGTGC AGCCACAACG CGCAGCCCGT GGGCTCGTGA TGCTTGTAGG TCACCTCTGC 
23161 AAACGACTGC AGGTACGCCT GCAGGAATCG CCCCATCATC GTCACAAAGG TCTTGTTGCT 
23221 GGTGAAGGTC AGCTGCAACC CGCGGTGCTC CTCGTTCAGC CAGGTCTTGC ATACGGCCGC 
232 Bl CAGAGCTTCC ACTTGGTCAG GCAGTAGTTT GAAGTTCGCC TTTAGATCGT TATCCACGTG 
23341 GTACTTGTCC ATCAGCGCGC GCGCAGCCTC CATGCCCTTC TCCCACGCAG ACACGATCGG 
23401 CACACTCAGC GGGTTCATCA CCGTAATTTC ACTTTCCGCT TCGCTGGGCT CTTCCTCTTC 
23461 CTCTTGCGTC CGCATACCAC GCGCCACTGG GTCGTCTTCA TTCAGCCGCC GCACTGTGCG 
23521 CTTACCTCCT TTGCCATGCT TGATTAGCAC CGGTGGGTTG CTGAAACCCA CCATTTGTAG 
23581 CGCCACATCT TCTCTTTCTT CCTCGCTGTC CACGATTACC TCTGGTGATG GCGGGCGCTC 
23641 GGGCTTGGGA GAAGGGCGCT TCTTTTTCTT CTTGGGCGCA ATGGCCAAAT CCGCCGCCGA 
23701 GGTCGATGGC CGCGGGCTGG GTGTGCGCGG CACCAGCGCG TCTTGTGATG AGTCTTCCTC 
23761 GTCCTCGGAC TCGATACGCC GCCTCATCCG CTTTTTTGGG GGCGCCCGGG GAGGCGGCGG 
23 821 CGACGGGGAC GGGGACGACA CGTCCTCCAT GGTTGGGGGA CGTCGCGCCG CACCGCGTCC 
23881 GCGCTCGGGG GTGGTTTCGC GCTGCTCCTC TTCCCGACTG GCCATTTCCT TCTCCTATAG 
23941 GCAGAAAAAG ATCATGGAGT CAGTCGAGAA GAAGGACAGC CTAACCGCCC CCTCTGAGTT 
24001 CGCCACCACC GCCTCCACCG ATGCCGCCAA CGCGCCTACC ACCTTCCCCG TCGAGGCACC 
24061 CCCGCTTGAG GAGGAGGAAG TGATTATCGA GCAGGACCCA GGTTTTGTAA GCGAAGACGA 
24121 CGAGGACCGC TCAGTACCAA CAGAGGATAA AAAGCAAGAC CAGGACAACG CAGAGGCAAA 
24181 CGAGGAACAA GTCGGGCGGG GGGACGAAAG GCATGGCGAC TACCTAGATG TGGGAGACGA 
24241 CGTGCTGTTG AAGCATCTGC AGCGCCAGTG CGCCATTATC TGCGACGCGT TGCAAGAGCG 
24301 CAGCGATGTG CCCCTCGCCA TAGCGGATCT CAGCCTTGCC TACGAACGCC ACCTATTCTC 
24361 ACCGCGCGTA CCCCCCAAAC GCCAAGAAAA CGGCACATGC GAGCCCAACC CGCGCCTCAA 
24421 CTTCTACCCC GTATTTGCCG TGCCAGAGGT GCTTGCCACC TATCACATCT TTTTCCAAAA 
24481 CTGCAAGATA CCCCTATCCT GCCGTGCCAA CCGCAGCCGA GCGGACAAGC AGCTGGCCTT 
24541 GCGGCAGGGC GCTGTCATAC CTGATATCGC CTCGCTCAAC GAAGTGCCAA AAATCTTTGA 
24601 GGGTCTTGGA CGCGACGAGA AGCGCGCGGC AAACGCTCTG CAACAGGAAA ACAGCGAAAA 
24661 TGAAAGTCAC TCTGGAGTGT TGGTGGAACT CGAGGGTGAC AACGCGCGCC TAGCCGTACT 
24721 AAAACGCAGC ATCGAGGTCA CCCACTTTGC CTACCCGGCA CTTAACCTAC CCCCCAAGGT 
24781 CATGAGCACA GTCATGAGTG AGCTGATCGT GCGCCGTGCG CAGCCCCTGG AGAGGGATGC 
24841 AAATTTGCAA GAACAAACAG AGGAGGGCCT ACCCGCAGTT GGCGACGAGC AGCTAGCGCG 
24901 CTGGCTTCAA ACGCGCGAGC CTGCCGACTT GGAGGAGCGA CGCAAACTAA TGATGGCCGC 
24961 AGTGCTCGTT ACCGTGGAGC TTGAGTGCAT GCAGCGGTTC TTTGCTGACC CGGAGATGCA 
25021 GCGCAAGCTA GAGGAAACAT TGCACTACAC CTTTCGACAG GGCTACGTAC GCCAGGCCTG 
25081 CAAGATCTCC AACGTGGAGC TCTGCAACCT GGTCTCCTAC CTTGGAATTT TGCACGAAAA 
25141 CCGCCTTGGG CAAAACGTGC TTCATTCCAC GCTCAAGGGC GAGGCGCGCC GCGACTACGT 
25201 CCGCGACTGC GTTTACTTAT TTCTATGCTA CACCTGGCAG ACGGCCATGG GCGTTTGGCA 
25261 GCAGTGCTTG GAGGAGTGCA ACCTCAAGGA GCTGCAGAAA CTGCTAAAGC AAAACTTGAA 
25321 GGACCTATGG ACGGCCTTCA ACGAGCGCTC CGTGGCCGCG CACCTGGCGG ACATCATTTT 
25381 CCCCGAACGC CTGCTTAAAA CCCTGCAACA GGGTCTGCCA GACTTCACCA GTCAAAGCAT 
25441 GTTGCAGAAC TTTAGGAACT TTATCCTAGA GCGCTCAGGA ATCTTGCCCG CCACCTGCTG 
25501 TGCACTTCCT AGCGACTTTG TGCCCATTAA GTACCGCGAA TGCCCTCCGC CGCTTTGGGG 
25561 CCACTGCTAC CTTCTGCAGC TAGCCAACTA CCTTGCCTAC CACTCTGACA TAATGGAAGA 
25621 CGTGAGCGGT GACGGTCTAC TGGAGTGTCA CTGTCGCTGC AACCTATGCA CCCCGCACCG 
25681 CTCCCTGGTT TGCAATTCGC AGCTGCTTAA CGAAAGTCAA ATTATCGGTA CCTTTGAGCT 
25741 GCAGGGTCCC TCGCCTGACG AAAAGTCCGC GGCTCCGGGG TTGAAACTCA CTCCGGGGCT 
25801 GTGGACGTCG GCTTACCTTC GCAAATTTGT ACCTGAGGAC TACCACGCCC ACGAGATTAG 
25861 GTTCTACGAA GACCAATCCC GCCCGCCAAA TGCGGAGCTT ACCGCCTGCG TCATTACCCA 
25921 GGGCCACATT CTTGGCCAAT TGCAAGCCAT CAACAAAGCC CGCCAAGAGT TTCTGCTACG 
25981 AAAGGGACGG GGGGTTTACT TGGACCCCCA GTCCGGCGAG GAGCTCAACC CAATCCCCCC 
26041 GCCGCCGCAG CCCTATCAGC AGCAGCCGCG GGCCCTTGCT TCCCAGGATG GCACCCAAAA 
26101 AGAAGCTGCA GCTGCCGCCG CCACCCACGG ACGAGGAGGA ATACTGGGAC AGTCAGGCAG 
26161 AGGAGGTTTT GGACGAGGAG GAGGAGGACA TGATGGAAGA CTGGGAGAGC CTAGACGAGG 
26221 AAGCTTCCGA GGTCGAAGAG GTGTCAGACG AAACACCGTC ACCCTCGGTC GCATTCCCCT 
26281 CGCCGGCGCC CCAGAAATCG GCAACCGGTT CCAGCATGGC TACAACCTCC GCTCCTCAGG 
26341 CGCCGCCGGC ACTGCCCGTT CGCCGACCCA ACCGTAGATG GGACACCACT GGAACCAGGG 
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26401 CCGGTAAGTC CAAGCAGCCG CCGCCGTTAG CCCAAGAGCA ACAACAGCGC CAAGGCTACC 
26461 GCTCATGGCG CGGGCACAAG AACGCCATAG TTGCTTGCTT GCAAGACTGT GGGGGCAACA 
26521 TCTCCTTCGC CCGCCGCTTT CTTCTCTACC ATCACGGCGT GGCCTTCCCC CGTAACATCC 
26581 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG CGGCAGCGGC AGCGGCAGCA 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTGAC AAAGCCCAAG 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC GTCTGGCGCC CAACGAACCC 
26761 GTATCGACCC GCGAGCTTAG AAACAGGATT TTTCCCACTC TGTATGCTAT ATTTCAACAG 
26821 AGCAGGGGCC AAGAACAAGA GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC 
26861 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
26941 CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT TTCTCAAATT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG CGCCAGCACC TGTCGTCAGC 
27061 GCCATTATGA GCAAGGAAAT TCCCACGCCC TACATGTGGA GTTACCAGCC ACAAATGGGA 
27121 CTTGCGGCTG GAGCTGCCCA AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC 
27181 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA ACCGAATTCT CTTGGAACAG 
27241 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC CGCTGCCCTG 
27301 GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC CCAGAGACGC CCAGGCCGAA 
27361 GTTCAGATGA CTAACTCAGG GGCGCAGCTT GCGGGCGGCT TTCGTCACAG GGTGCGGTCG 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG 
27481 TCGGTGAGCT CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC CTCTGAGCCG 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT TTGTGCCATC GGTCTACTTT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT TTATTCCTAA CTTTGACGCG 
27721 GTAAAGGACT CGGCGGACGG CTACGACTGA TAATTAAGTG GAGAGGCAGA GCAACTGCGC 
27781 CTGAAACACC TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGATCTTT GTTGCCATCT CTGTGCTGAG 
27901 TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT CGCCATCCTG TAAACGCCAC 
27961 CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT GGTACTTTTA ACATCTCTCC 
28021 CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT CTACGAGAGA ACCTCTCCGA 
28081 GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC TGCCGGGAAC GTACCCTTAA 
28141 TTAAAAGTCA GGCTTCCTGG ATGTCAGCAT CTGACTTTGG CCAGCACCTG TCCCGCGGAT 
28201 TTGTTCCAGT CCAACTACAG CGACCCACCC TAACAGAGAT GACCAACACA ACCAACGCGG 
28261 CCGCCGCTAC CGGACTTACA TCTACCACAA ATACACCCCA AGTTTCTGCC TTTGTCAATA 
28321 ACTGGGATAA CTTGGGCATG TGGTGGTTCT CCATAGCGCT TATGTTTGTA TGCCTTATTA 
28381 TTATGTGGCT CATCTGCTGC CTAAAGCGCA AACGCGCCCG ACCACCCATC TATAGTCCCA 
28441 TCATTGTGCT ACACCCAAAC AATGATGGAA TCCATAGATT GGACGGACTG AAACACATGT 
28501 TCTTTTCTCT TACAGTATGA TTAAATGAGA TTAATTAAGG AATTTCTGTC CAGTTTATTC 
28561 AGCAGCACCT CCTTGCCCTC CTCCCAGCTC TGGTATTGCA GCTTCCTCCT GGCTGCAAAC 
28621 TTTCTCCACA ATCTAAATGG AATGTCAGTT TCCTCCTGTT CCTGTCCATC CGCACCCACT 
28681 ATCTTCATGT TGTTGCAGAT GAAGCGCGCA AGACCGTCTG AAGATACCTT CAACCCCGTG 
28741 TATCCATATG ACACGGAAAC CGGTCCTCCA ACTGTGCCTT TTCTTACTCC TCCCTTTGTA 
28801 TCCCCCAATG GGTTTCAAGA GAGTCCCCCT GGGGTACTCT CTTTGCGCCT ATCCGAACCT 
28861 CTAGTTACCT CCAATGGCAT GCTTGCGCTC AAAATGGGCA ACGGCCTCTC TCTGGACGAG 
28921 GCCGGCAACC TTACCTCCCA AAATGTAACC ACTGTGAGCC CACCTCTCAA AAAAACCAAG 
28981 TCAAACATAA ACCTGGAAAT ATCTGCACCC CTCACAGTTA CCTCAGAAGC CCTAACTGTG 
29041 GCTGCCGCCG CACCTCTAAT GGTCGCGGGC AACACACTCA CCATGCAATC ACAGGCCCCG 
29101 CTAACCGTGC ACGACTCCAA ACTTAGCATT GCCACCCAAG GACCCCTCAC AGTGTCAGAA 
29161 GGAAAGCTAG CCCTGCAAAC ATCAGGCCCC CTCACCACCA CCGATAGCAG TACCCTTACT 
29221 ATCACTGCCT CACCCCCTCT AACTACTGCC ACTGGTAGCT TGGGCATTGA CTTGAAAGAG 
29281 CCCATTTATA CACAAAATGG AAAACTAGGA CTAAAGTACG GGGCTCCTTT GCATGTAACA 
29341 GACGACCTAA ACACTTTGAC CGTAGCAACT GGTCCAGGTG TGACTATTAA TAATACTTCC 
29401 TTGCAAACTA AAGTTACTGG AGCCTTGGGT TTTGATTCAC AAGGCAATAT GCAACTTAAT 
29461 GTAGCAGGAG GACTAAGGAT TGATTCTCAA AACAGACGCC TTATACTTGA TGTTAGTTAT 
29521 CCGTTTGATG CTCAAAACCA ACTAAATCTA AGACTAGGAC AGGGCCCTCT TTTTATAAAC 
29581 TCAGCCCACA ACTTGGATAT TAACTACAAC AAAGGCCTTT ACTTGTTTAC AGCTTCAAAC 
29641 AATTCCAAAA AGCTTGAGGT TAACCTAAGC ACTGCCAAGG GGTTGATGTT TGACGCTACA 
29701 GCCATAGCCA TTAATGCAGG AGATGGGCTT GAATTTGGTT CACCTAATGC ACCAAACACA 
29761 AATCCCCTCA AAACAAAAAT TGGCCATGGC CTAGAATTTG ATTCAAACAA GGCTATGGTT 
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29821 CCTAAACTAG GAACTGGCCT TAGTTTTGAC AGCACAGGTG CCATTACAGT AGGAAACAAA 

2 9881 AATAATGATA AGCTAACTTT GTGGACCACA CCAGCTCCAT CTCCTAACTG TAGACTAAAT 
29941 GCAGAGAAAG ATGCTAAACT CACTTTGGTC TTAACAAAAT GTGGCAGTCA AATACTTGCT 

3 0001 ACAGTTTCAG TTTTGGCTGT TAAAGGCAGT TTGGCTCCAA TATCTGGAAC AGTTCAAAGT 
3 0061 GCTCATCTTA TTATAAGATT TGACGAAAAT GGAGTGCTAC TAAACAATTC CTTCCTGGAC 
3 0121 CCAGAATATT GGAACTTTAG AAATGGAGAT CTTACTGAAG GCACAGCCTA TACAAACGCT 
30181 GTTGGATTTA TGCCTAACCT ATCAGCTTAT CCAAAATCTC ACGGTAAAAC TGCCAAAAGT 
3 0241 AACATTGTCA GTCAAGTTTA CTTAAACGGA GACAAAACTA AACCTGTAAC ACTAACCATT 
3 0301 ACACTAAACG GTACACAGGA AACAGGAGAC ACAACTCCAA GTGCATACTC TATGTCATTT 
3 0361 TCATGGGACT GGTCTGGCCA CAACTACATT AATGAAATAT TTGCCACATC CTCTTACACT 
3 0421 TTTTCATACA TTGCCCAAGA ATAAAGAATC GTTTGTGTTA TGTTTCAACG TGTTTATTTT 
30481 TCAATTGCAG AAAATTTCAA GTCATTTTTC ATTCAGTAGT ATAGCCCCAC CACCACATAG 
3 0541 CTTATACAGA TCACCGTACC TTAATCAAAC TCACAGAACC CTAGTATTCA ACCTGCCACC 
30601 TCCCTCCCAA CACACAGAGT ACACAGTCCT TTCTCCCCGG CTGGCCTTAA AAAGCATCAT 
30661 ATCATGGGTA ACAGACATAT TCTTAGGTGT TATATTCCAC ACGGTTTCCT GTCGAGCCAA 
30721 ACGCTCATCA GTGATATTAA TAAACTCCCC GGGCAGCTCA CTTAAGTTCA TGTCGCTGTC 
3 0781 CAGCTGCTGA GCCACAGGCT GCTGTCCAAC TTGCGGTTGC TTAACGGGCG GCGAAGGAGA 
30841 AGTCCACGCC TACATGGGGG TAGAGTCATA ATCGTGCATC AGGATAGGGC GGTGGTGCTG 
30901 CAGCAGCGCG CGAATAAACT GCTGCCGCCG CCGCTCCGTC CTGCAGGAAT ACAACATGGC 
30961 AGTGGTCTCC TCAGCGATGA TTCGCACCGC CCGCAGCATA AGGCGCCTTG TCCTCCGGGC 
31021 ACAGCAGCGC ACCCTGATCT CACTTAAATC AGCACAGTAA CTGCAGCACA GCACCACAAT 
31081 ATTGTTCAAA ATCCCACAGT GCAAGGCGCT GTATCCAAAG CTCATGGCGG GGACCACAGA 
31141 ACCCACGTGG CCATCATACC ACAAGCGCAG GTAGATTAAG TGGCGACCCC TCATAAACAC 
31201 GCTGGACATA AACATTACCT CTTTTGGCAT GTTGTAATTC ACCACCTCCC GGTACCATAT 
31261 AAACCTCTGA TTAAACATGG CGCCATCCAC CACCATCCTA AACCAGCTGG CCAAAACCTG 
31321 CCCGCCGGCT ATACACTGCA GGGAACCGGG ACTGGAACAA TGACAGTGGA GAGCCCAGGA 
31381 CTCGTAACCA TGGATCATCA TGCTCGTCAT GATATCAATG TTGGCACAAC ACAGGCACAC 
31441 GTGCATACAC TTCCTCAGGA TTACAAGCTC CTCCCGCGTT AGAACCATAT CCCAGGGAAC 
31501 AACCCATTCC TGAATCAGCG TAAATCCCAC ACTGCAGGGA AGACCTCGCA CGTAACTCAC 
31561 GTTGTGCATT GTCAAAGTGT TACATTCGGG CAGCAGCGGA TGATCCTCCA GTATGGTAGC 
31621 GCGGGTTTCT GTCTCAAAAG GAGGTAGACG ATCCCTACTG TACGGAGTGC GCCGAGACAA 
31681 CCGAGATCGT GTTGGTCGTA GTGTCATGCC AAATGGAACG CCGGACGTAG TCATATTTCC 
31741 TGAAGCAAAA CCAGGTGCGG GCGTGACAAA CAGATCTGCG TCTCCGGTCT CGCCGCTTAG 
31801 ATCGCTCTGT GTAGTAGTTG TAGTATATCC ACTCTCTCAA AGCATCCAGG CGCCCCCTGG 
31861 CTTCGGGTTC TATGTAAACT CCTTCATGCG CCGCTGCCCT GATAACATCC ACCACCGCAG 
31921 AATAAGCCAC ACCCAGCCAA CCTACACATT CGTTCTGCGA GTCACACACG GGAGGAGCGG 
31981 GAAGAGCTGG AAGAACCATG TTTTTTTTTT TATTCCAAAA GATTATCCAA AACCTCAAAA 
32041 TGAAGATCTA TTAAGTGAAC GCGCTCCCCT CCGGTGGCGT GGTCAAACTC TACAGCCAAA 
32101 GAACAGATAA TGGCATTTGT AAGATGTTGC ACAATGGCTT CCAAAAGGCA AACGGCCCTC 
32161 ACGTCCAAGT GGACGTAAAG GCTAAACCCT TCAGGGTGAA TCTCCTCTAT AAACATTCCA 
32221 GCACCTTCAA CCATGCCCAA ATAATTCTCA TCTCGCCACC TTCTCAATAT ATCTCTAAGC 
32281 AAATCCCGAA TATTAAGTCC GGCCATTGTA AAAATCTGCT CCAGAGCGCC CTCCACCTTC 
32341 AGCCTCAAGC AGCGAATCAT GATTGCAAAA ATTCAGGTTC CTCACAGACC TGTATAAGAT 
32401 TCAAAAGCGG AACATTAACA AAAATACCGC GATCCCGTAG GTCCCTTCGC AGGGCCAGCT 
32461 GAACATAATC GTGCAGGTCT GCACGGACCA GCGCGGCCAC TTCCCCGCCA GGAACCTTGA 
32521 CAAAAGAACC CACACTGATT ATGACACGCA TACTCGGAGC TATGCTAACC AGCGTAGCCC 
32581 CGATGTAAGC TTTGTTGCAT GGGCGGCGAT ATAAAATGCA AGGTGCTGCT CAAAAAATCA 
32641 GGCAAAGCCT CGCGCAAAAA AGAAAGCACA TCGTAGTCAT GCTCATGCAG ATAAAGGCAG 
32701 GTAAGCTCCG GAACCACCAC AGAAAAAGAC ACCATTTTTC TCTCAAACAT GTCTGCGGGT 
32761 TTCTGCATAA ACACAAAATA AAATAACAAA AAAACATTTA AACATTAGAA GCCTGTCTTA 
32821 CAACAGGAAA AACAACCCTT ATAAGCATAA GACGGACTAC GGCCATGCCG GCGTGACCGT 
32881 AAAAAAACTG GTCACCGTGA TTAAAAAGCA CCACCGACAG CTCCTCGGTC ATGTCCGGAG 
32941 TCATAATGTA AGACTCGGTA AACACATCAG GTTGATTCAT CGGTCAGTGC TAAAAAGCGA 
33001 CCGAAATAGC CCGGGGGAAT ACATACCCGC AGGCGTAGAG ACAACATTAC AGCCCCCATA 
33061 GGAGGTATAA CAAAATTAAT AGGAGAGAAA AACACATAAA CACCTGAAAA ACCCTCCTGC 
33121 CTAGGCAAAA TAGCACCCTC CCGCTCCAGA ACAACATACA GCGCTTCACA GCGGCAGCCT 
33181 AACAGTCAGC CTTACCAGTA AAAAAGAAAA CCTATTAAAA AAACACCACT CGACACGGCA 
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33241 CCAGCTCAAT CAGTCACAGT GTAAAAAAGG GCCAAGTGCA GAGCGAGTAT ATATAGGACT 
33301 AAAAAATGAC GTAACGGTTA AAGTCCACAA AAAACACCCA GAAAACCGCA CGCGAACCTA 
33361 CGCCCAGAAA CGAAAGCCAA AAAACCCACA ACTTCCTCAA ATCGTCACTT CCGTTTTCCC 
33421 ACGTTACGTA ACTTCCCATT TTAAGAAAAC TACAATTCCC AACACATACA AGTTACTCCG 
33481 CCCTAAAACC TACGTCACCC GCCCCGTTCC CACGCCCCGC GCCACGTCAC AAACTCCACC 
33541 CCCTCATTAT CATATTGGCT TCAATCCAAA ATAAGGTATA TTATTGATGA TG 
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DEFINITION 
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KD3 
KD3 
KD3 



34341 bp 



DNA 



SYN 



06-FEB-1999 



REFERENCE 
AUTHORS 
JOURNAL 

FEATURES 
CDS 



ORGANISM 



Unknown . 

Unknown 

Unclassified. 

1 (bases 1 to 34341) 

Self 

Unpublished . 



Location/Qualifiers 

1. .34341 

/gene="KD3 n 
/products "KD3" 



BASE COUNT 
ORIGIN 



7951 a 9S71 c 9464 g 



7255 t 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCGAAGA 
661 TCCCAACGAG GAGGCGGTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTGCAGGA 
721 AGGGATTGAC TTACTCACTT TTCCGCCGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
7B1 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAGAGGG TGAGGAGTTT GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGG AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AGTGAAAATT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTGAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACCGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AGCTGTGACT CCGGTCCTTC TAACACACCT CCTGAGATAC ACCCGGTGGT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACGCCC CAGGCCATAA 
1441 GGTGTAAACC TGTGATTGCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGCGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 GAGTGTTTGG AAGATTTTTC TGCTGTGCGT AACTTGCTGG AACAGAGCTC TAACAGTACC 
1681 TCTTGGTTTT GGAGGTTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAATTAAG 
1741 GAGGATTACA AGTGGGAATT TGAAGAGCTT TTGAAATCCT GTGGTGAGCT GTTTGATTCT 
1801 TTGAATCTGG GTCACCAGGC GCTTTTCCAA GAGAAGGTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GCGCTGCGGC TGCTGTTGCT TTTTTGAGTT TTATAAAGGA TAAATGGAGC 
1921 GAAGAAACCC ATCTGAGCGG GGGGTACCTG CTGGATTTTC TGGCCATGCA TCTGTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTGTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAGC 
2101 CCATGGAACC CGAGAGCCGG CCTGGACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGGA TGGGCAGGGG CTAAAGGGGG 
2221 TAAAGAGGGA GCGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGGAATCTA GCTTTTAGCT 
2281 TAATGACCAG ACACCGTCCT GAGTGTATTA CTTTTCAACA GATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTGA TCTGCTGGCG CAGAAGTATT CCATAGAGCA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGGA TGATTTTGAG GAGGCTATTA GGGTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC ATTTCTGGGA 
2521 ACGGGGCCGA GGTGGAGATA GATACGGAGG ATAGGGTGGC CTTTAGATGT AGCATGATAA 
2581 ATATGTGGCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCGGTACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCGGGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TGAGGGTAAC TCCAGGGTGC 
2881 GCCACAATGT GGCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGGAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT GAACCTGAAC GGGGTGTTTG ACATGACCAT GAAGATCTGG AAGGTGCTGA 
3241 GGTACGATGA GACCCGCACC AGGTGCAGAC CCTGCGAGTG TGGCGGTAAA CATATTAGGA 
3301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CGATCACTTG GTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAATGTGTG 
3421 GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTGCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCCGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACCGTGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCGCTTC AGCCGCTGCA GCCACCGCCC 
3721 GCGGGATTGT GACTGACTTT GCTTTCCTGA GCCCGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTGGATTCT TTGACCCGGG 
3841 AACTTAATGT CGTTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCTGAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATAAAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTGCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CGCTGGGCGT 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCCC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT GTCATGTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCGTCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCGG CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CGCGGGCGGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTTA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCGACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAGAGAGCT GCAGCTGCCG TCATCCCTGA GCAGGGGGGC CACTTCGTTA AGCATGTCCC 
4921 TGACTCGCAT GTTTTCCCTG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGATAGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAACG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TGAGCGTTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCACCTGC TCTACGGCAT 
5101 CTCGATCCAG CATATCTCCT CGTTTCGCGG GTTGGGGCGG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCGTCAG 
5221 CGTAGTCTGG GTCACGGTGA AGGGGTGCGC TCCGGGCTGC GCGCTGGCCA GGGTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTG CCGGTCTTCG CCCTGCGCGT CGGCCAGGTA 
5341 GCATTTGACC ATGGTGTCAT AGTCCAGCCC CTCCGCGGCG TGGCCCTTGG CGCGCAGCTT 
5401 GCCCTTGGAG GAGGCGCCGC ACGAGGGGCA GTGCAGACTT TTGAGGGCGT AGAGCTTGGG 
5461 CGCGAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGGCCCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGCCG TTCGGGGTCA AAAACCAGGT TTCCCCCATG 
5581 CTTTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCGG TGTCCACGCT CGGTGACGAA 
5641 AAGGCTGTCC GTGTCCCCGT ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTGTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGGACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGGTCCA CTCGCTCCAG 
5821 GGTGTGAAGA CACATGTCGC CCTCTTCGGC ATCAAGGAAG GTGATTGGTT TGTAGGTGTA 
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5881 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG CGCGTTCGTC 
S941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGGTGAGT ACTCCCTCTG 
6001 AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG CCGCGATGTT 
6241 TAGCTGCACG TATTCGCGCG CAACGCACCG CCATTCGGGA AAGACGGTGG TGCGCTCGTC 
6301 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGGTG ACAAGGTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CCGCCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCGTCTCGTC CGGGGGGTCT GCGTCCACGG TAAAGACCCC 
6481 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TGCAAGTCTA GCGCCTGCTG 
6541 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCGTAA ACGTAGAGGG GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAGGGAGCGA GGAGGTCGGG ACCGAGGTTG CTACGGGCGG GCTGCTCTGC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACGTTGAAG CTGGCGTCTG TGAGACCTAC CGCGTCACGC ACGAAGGAGG CGTAGGAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGCGCAGT AGTCCAGGGT 
6961 TTCCTTGATG ATGTCATACT TATCCTGTCC CTTTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCGCGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGTGTCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACG CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTG CGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCGGAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGACGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCGGGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGGTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACGAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAAGAGACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCGCCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGACGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
8041 GACCTGACGA CCGCGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTCGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCCGCGCGA GCCCAAAGTC CAGATGTCCG CGCGCGGCGG 
8221 TCGGAGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCCGCGG 
8281 CGTCAGGTCA GGCGGGAGCT CCTGCAGGTT TACCTCGCAT AGACGGGTCA GGGCGCGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGGCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCGCGCGGC GGGCGGTGGG CCGCGGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCCGGAGG TAGGGGGGGC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGCGCGCGGG CAGGAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCCTCTGCG TGAAGACX3AC GGGCCCGGTG AGCTTGAGCC TGAAAGAGAG TTCGACAGAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CGCAAAATCT CCTGCACGTC TCCTGAGTTG 
8761 TCTTGATAGG CGATCTCGGC CATGAACTGC TCGATCTCTT CCTCCTGGAG ATCTCCGCGT 
8821 CCGGCTCGCT CCACGGTGGC GGCGAGGTCG TTGGAAATGC GGGCCATGAG CTGCGAGAAG 
8881 GCOTTGAGGC CTCCCTCGTT CCAGACGCGG CTGTAGACCA CGCCCCCTTC GGCATCGCGG 
8941 GCGCGCATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCGAAGAC GGCGTAGTTT 
9001 CGCAGGCGCT GAAAGAGGTA GTTGAGGGTG GTGGCGGTGT GTTCTGCCAC GAAGAAGTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAG GCGCTCCATG 
9121 GCCTCGTAGA AGTCCACGGC GAAGTTGAAA AACTGGGAGT TGCGCGCCGA CACGGTTAAC 
9181 TCCTCCTCCA GAAGACGGAT GAGCTCGGCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 
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9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG CGGCGACGAC GGCGCACCGG GAGGCGGTCG 
9361 ACAAAGCGCT CGATCATCTC CCCGCGGCGA CGGCGCATGG TCTCGGTGAC GGCGCGGCCG 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG CCCGTCATGT CCCGGTTATG GGTTGGCGGG 
9481 GGGCTGCCAT GCGGCAGGGA TACGGCGCTA ACGATGCATC TCAACAATTG TTGTGTAGGT 
9541 ACTCCGCCGC CGAGGGACCT GAGCGAGTCC GCATCGACCG GATCGGAAAA CCTCTCGAGA 
9601 AAGGCGTCTA ACCAGTCACA GTCGCAAGGT AGGCTGAGCA CCGTGGCGGG CGGCAGCGGG 
9661 CGGCGGTCGG GGTTGTTTCT GGCGGAGGTG CTGCTGATGA TGTAATTAAA GTAGGCGGTC 
9721 TTGAGACGGC GGATGGTCGA CAGAAGCACC ATGTCCTTGG GTCCGG CCTG CTGAATGCGC 
9781 AGGCGGTCGG CCATGCCCCA GGCTTCGTTT TGACATCGGC GCAGGTCTTT GTAGTAGTCT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
9901 TCTATCGCTG CGGCGGCGGC GGAGTTTGGC CGTAGGTGGC GCCCTCTTCC TCCCATGCGT 
9961 GTGACCCCGA AGCCCCTCAT CGGCTGAAGC AGGGCTAGGT CGGCGACAAC GCGCTCGGCT 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA GACTGGAAGT CATCCATGTC CACAAAGCGG 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG CAGTTGGCCA TAACGGACCA GTTAACGGTC 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC CTGAGACGCG AGTAAGCCCT CGAGTCAAAT 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC TGGTATCCCA CCAAAAAGTG CGGCGGCGGC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC GGGGCTCCGG GGGCGAGATC TTCCAACATA 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGGAC ATCCAGGTGA TGCCGGCGGC GGTGGTGGAG 
10381 GCGCGCGGAA AGTCGCGGAC GCGGTTCCAG ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
10441 GTCGGGACGC TCTGGCCGGT CAGGCGCGCG CAATCGTTGA CGCTCTAGCG TGCAAAAGGA 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC TGGTGGATAA ATTCGCAAGG GTATCATGGC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG CCGTCCGCCG TGATCCATGC GGTTACCGCC 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT TTTGGCCACT GGCCGCGCGC AGCGTAAGCG 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
10801 CAAGGGTTGA GTCGCGGGAC CCCCGGTTCG AGTCTCGGAC CGGCCGGACT GCGGCGAACG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC GCTTGCAAAT TCCTCCGGAA ACAGGGACGA 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC GGTGCTGCGG CAGATGCGCC CCCCTCCTCA 
10981 GCAGCGGCAA GAGCAAGAGC AGCGGCAGAC ATGCAGGGCA CCCTCCCCTC CTCCTACCGC 
11041 GTCAGGAGGG GCGACATCCG CGGTTGACGC GGCAGCAGAT GGTGATTACG AACCCCCGCG 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA GGAGGGCGAG GGCCTGGCGC GGCTAGGAGC 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA GCTGAAGCGT GATACGCGTG AGGCGTACGT 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA GGGAGAGGAG CCCGAGGAGA TGCGGGATCG 
11281 AAAGTTCCAC GCAGGGCGCG AGCTGCGGCA TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
11341 GGAGGACTTT GAGCCCGACG CGCGAACCGG GATTAGTCCC GCGCGCGCAC ACGTGGCGGC 
11401 CGCCGACCTG GTAACCGCAT ACGAGCAGAC GGTGAACCAG GAGATTAACT TTCAAAAAAG 
11461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC GCGCGAGGAG GTGGCTATAG GACTGATGCA 
11521 TCTGTGGGAC TTTGTAAGCG CGCTGGAGCA AAACCCAAAT AGCAAGCCGC TCATGGCGCA 
11581 GCTGTTCCTT ATAGTGCAGC ACAGCAGGGA CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
11641 CATAGTAGAG CCCGAGGGCC GCTGGCTGCT CGATTTGATA AACATCCTGC AGAGCATAGT 
11701 GGTGCAGGAG CGCAGCTTGA GCCTGGCTGA CAAGGTGGCC GCCATCAACT ATTCCATGCT 
11761 TAGCCTGGGC AAGTTTTACG CCCGCAAGAT ATACCATACC CCTTACGTTC CCATAGACAA 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG CATGGCGCTG AAGGTGCTTA CCTTGAGCGA 
11881 CGACCTGGGC GTTTATCGCA ACGAGCGCAT CCACAAGGCC GTGAGCGTGA GCCGGCGGCG 
11941 CGAGCTCAGC GACCGCGAGC TGATGCACAG CCTGCAAAGG GCCCTGGCTG GCACGGGCAG 
12001 CGGCGATAGA GAGGCCGAGT CCTACTTTGA CGCGGGCGCT GACCTGCGCT GGGCCCCAAG 
12061 CCGACGCGCC CTGGAGGCAG CTGGGGCCGG ACCTGGGCTG GCGGTGGCAC CCGCGCGCGC 
12121 TGGCAACGTC GGCGGCGTGG AGGAATATGA CGAGGACGAT GAGTACGAGC CAGAGGACGG 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAG ATGATGCAAG ACGCAACGGA CCCGGCGGTG 
12241 CGGGCGGCGC TGCAGAGCCA GCCGTCCGGC CTTAACTCCA CGGACGACTG GCGCCAGGTC 
12301 ATGGACCGCA TCATGTCGCT GACTGCGCGC AATCCTGACG CGTTCCGGCA GCAGCCGCAG 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCG GTGGTCCCGG CGCGCGCAAA CCCCACGCAC 
12421 GAGAAGGTGC TGGCGATCGT AAACGCGCTG GCCGAAAACA GGGCCATCCG GCCCGACGAG 
12481 GCCGGCCTGG TCTACGACGC GCTGCTTCAG CGCGTGGCTC GTTACAACAG CGGCAACGTG 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT GTGCGCGAGG CCGTGGCGCA GCGTGAGCGC 
12601 GCGCAGCAGC AGGGCAACCT GGGCTCCATG GTTGCACTAA ACGCCTTCCT GAGTACACAG 
12661 CCCGCCAACG TGCCGCGGGG ACAGGAGGAC TACACCAACT TTGTGAGCGC ACTGCGGCTA 
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12721 ATGGTGACTG AGACACCGCA AAGTGAGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAGAC AAGGCCTGCA GACCGTAAAC CTGAGCCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC CGCGCGACCG TGTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTGTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCGAGG CCATAGGTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGG AGGCAACCCT AAACTACCTG CTGACCAACC GGCGGCAGAA GATCCCCTCG 
13141 TTGCACAGTT TAAACAGCGA GGAGGAGCGC ATTTTGCGCT ACGTGCAGCA GAGCGTGAGC 
13201 CTTAACCTGA TGCGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATGC CTCAAACCGG CCGTTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGCGG CCGCCGTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGGGGGA TTCGAGGTGC CCGAGGGTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGGGTGTTT TCCCCGCAAC CGCAGACCCT GCTAGAGTTG 
13 501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTGTCCGATC TAGGCGCTGC GGCCCCGCGG TCAGATGCTA GTAGCCCATT TCCAAGCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAACAACT CGCTGCTGCA GCCGCAGCGC GAAAAAAACC TGCCTCCGGC ATTTCCCAAC 
13741 AACGGGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AGACGTACGC GCAGGAGCAC 
L3801 AGGGACGTGC CAGGCCCGCG CCCGCCCACC CGTCGTCAAA GGCACGACCG TCAGCGGGGT 
L3861 CTGGTGTGGG AGGACGATGA CTCGGCAGAC GACAGCAGCG TCCTGGATTT GGGAGGGAGT 
L3921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
L3981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
L4041 CCCTTAGTAT GCGGCGCGCG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
L4101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
L4161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCGGGGG GAGAAACAGC ATCCGTTACT 
L4221 CTGAGTTGGC ACCCCTATTC GACACCACCC GTGTGTACCT GGTGGACAAC AAGTCAACGG 
L4281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
L4401 ACTGGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCCAAATGTG AACGAGTTCA 
L4461 TGTTTACCAA TAAGTTTAAG GCGCGGGTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
L4521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACGCGA TCGTGGAGCA CTACTTGAAA GTGGGCAGAC 
L4641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCGCAAC TTCAGACTGG 
L4701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
L4761 CAGACATCAT TTTGCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
L4821 TGTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGGATCACC TACGATGATC 
L4BB1 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTGAAAG 
L4941 ATGACACCGA ACAGGGCGGG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
L5001 AAGAGAACTC CAACGCGGCA GCCGCGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
L5061 CCATTCGCGG CGACACCTTT GCCACACGGG CTGAGGAGAA GCGCGCTGAG GCCGAAGCAG 
15121 CGGCCGAAGC TGCCGCCCCC GCTGCGCAAC CCGAGGTCGA GAAGCCTCAG AAGAAACCGG 
1.5 181 TGATCAAACC CCTGACAGAG GACAGCAAGA AACGCAGTTA CAACCTAATA AGCAATGACA 
L5241 GCACCTTCAC CCAGTACCGC AGCTGGTACC TTGCATACAA CTACGGCGAC CCTCAGACCG 
L5301 GAATCCGCTC ATGGACCCTG CTTTGCACTC CTGACGTAAC CTGCGGCTCG GAGCAGGTCT 
L5361 ACTGGTCGTT GCCAGACATG ATG CAAGACC CCGTGACCTT CCGCTCCACG CGCCAGATCA 
L5421 GCAACTTTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAGAGC TTCTACAACG 
L5481 ACCAGGCCGT CTACTCCCAA CTCATCCGCC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
L5541 GCTTTCCCGA GAACCAGATT TTGGCGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGTG 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCGCT GCGCAACAGC ATCGGAGGAG 
L5661 TCCAGCGAGT GACCATTACT GACGCCAGAC GCCGCACCTG CCCCTACGTT TACAAGGCCC 
L5721 TGGGCATAGT CTCGCCGCGC GTCCTATCGA GCCGCACTTT TTGAGCAAGC ATGTCCATCC 
L57B1 TTATATCGCC CAGCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAGCAAG ATGTTTGGCG 
L5841 GGGCCAAGAA GCGCTCCGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCGCGCCCT 
L5901 GGGGCGCGCA CAAACGCGGC CGCACTGGGC GCACCACCGT CGATGACGCC ATCGACGCGG 
L5961 TGGTGGAGGA GGCGCGCAAC TACACGCCCA CGCCGCCACC AGTGTCCACA GTGGACGCGG 
L6021 CCATTCAGAC CGTGGTGCGC GGAGCCCGGC GCTATGCTAA AATGAAGAGA CGGCGGAGGC 
L60B1 GCGTAGCACG TCGCCACCGC CGCCGACCCG GCACTGCCGC CCAACGCGCG GCGGCGGCCC 
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16141 TGCTTAACCG CGCACGTCGC ACCGGCCGAC GGGCGGCCAT GCGGGCCGCT CGAAGGCTGG 
1G201 CCGCGGGTAT TGTCACTGTG CCCCCCAGGT CCAGGCGACG AGCGGCCGCC GCAGCAGCCG 
16261 CGGCCATTAG TGCTATGACT CAGGGTCGCA GGGGCAACGT GTATTGGGTG CGCGACTCGG 
16321 TTAGCGGCCT GCGCGTGCCC GTGCGCACCC GCCCCCCGCG CAACTAGATT GCAAGAAAAA 
16381 ACTACTTAGA CTCGTACTGT TGTATGTATC CAGCGGCGGC GGCGCGCAAC GAAGCTATGT 
16441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC AGGTCATCGC GCCGGAGATC TATGGCCCCC 
16501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
16561 AAGATGATGA TGATGAACTT GACGACGAGG TGGAACTGCT GCACGCTACC GCGCCCAGGC 
16621 GACGGGTACA GTGGAAAGGT CGACGCGTAA AACGTGTTTT GCGACCCGGC ACCACCGTAG 
16681 TCTTTACGCC CGGTGAGCGC TCCACCCGCA CCTACAAGCG CGTGTATGAT GAGGTGTACG 
16741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
16801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
16861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC TTGCACCGTC CGAAGAAAAG CGCGGCCTAA 
16921 AGCGCGAGTC TGGTGACTTG GCACCCACCG TGCAGCTGAT GGTACCCAAG CGCCAGCGAC 
16981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG AACCTGGGCT GGAGCCCGAG GTCCGCGTGC 
17041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG GCGTGCAGAC CGTGGACGTT CAGATACCCA 
17101 CTACCAGTAG CACCAGTATT GCCACCGCCA CAGAGGGCAT GGAGACACAA ACGTCCCCGG 
17161 TTGCCTCAGC GGTGGCGGAT GCCGCGGTGC AGGCGGTCGC TGCGGCCGCG TCCAAGACCT 
17221 CTACGGAGGT GCAAACGGAC CCGTGGATGT TTCGCGTTTC AGCCCCCCGG CGCCCGCGCG 
17281 GTTCGAGGAA GTACGGCGCC GCCAGCGCGC TACTGCCCGA ATATGCCCTA CATCCTTCCA 
17341 TTGCGCCTAC CCCCGGCTAT CGTGGCTACA CCTACCGCCC CAGAAGACGA GCAACTACCC 
17401 GACGCCGAAC CACCACTGGA ACCCGCCGCC GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
17461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
17521 GCTACCACCC CAGCATCGTT TAAAAGCCGG TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
17581 CCTGCCGCCT CCGTTTCCCG GTGCCGGGAT TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
17641 TGGCCGGCCA CGGCCTGACG GGCGGCATGC GTCGTGCGCA CCACCGGCGG CGGCGCGCGT 
17701 CGCACCGTCG CATGCGCGGC GGTATCCTGC CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
17761 TTGGCGCCGT GCCCGGAATT GCATCCGTGG CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
17821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA AGTCTGGACT CTCACGCTCG CTTGGTCCTG 
17881 TAACTATTTT GTAGAATGGA AGACATCAAC TTTGCGTCTC TGGCCCCGCG ACACGGCTCG 
17941 CGCCCGTTCA TGGGAAACTG GCAAGATATC GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
18001 AGCTGGGGCT CGCTGTGGAG CGGCATTAAA AATTTCGGTT CCACCGTTAA GAACTATGGC 
18061 AGCAAGGCCT GGAACAGCAG CACAGGCCAG ATGCTGAGGG ATAAGTTGAA AGAGCAAAAT 
18121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
18181 AACCAGGCAG TGCAAAATAA GATTAACAGT AAGCTTGATC CCCGCCCTCC CGTAGAGGAG 
18241 CCTCCACCGG CCGTGGAGAC AGTGTCTCCA GAGGGGCGTG GCGAAAAGCG TCCGCGCCCC 
18301 GACAGGGAAG AAACTCTGGT GACGCAAATA GACGAGCCTC CCTCGTACGA GGAGGCACTA 
18361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
18421 CACACACCCG TAACGCTGGA CCTGCCTCCC CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
18481 CCAGGCCCGA CCGCCGTTGT TGTAACCCGT CCTAGCCGCG CGTCCCTGCG CCGCGCCGCC 
18541 AGCGGTCCGC GATCGTTGCG GCCCGTAGCC AGTGGCAACT GGCAAAGCAC ACTGAACAGC 
18601 ATCGTGGGTC TGGGGGTGCA ATCCCTGAAG CGCCGACGAT GCTTCTGAAT AGCTAACGTG 
18661 TCGTATGTGT GTCATGTATG CGTCCATGTC GCCGCCAGAG GAGCTGCTGA GCCGCCGCGC 
18721 GCCCGCTTTC CAAGATGGCT ACCCCTTCGA TGATGCCGCA GTGGTCTTAC ATGCACATCT 
18781 CGGGCCAGGA CGCCTCGGAG TACCTGAGCC CCGGGCTGGT GCAGTTTGCC CGCGCCACCG 
18841 AGACGTACTT CAGCCTGAAT AACAAGTTTA GAAACCCCAC GGTGGCGCCT ACGCACGACG 
18901 TGACCACAGA CCGGTCCCAG CGTTTGACGC TGCGGTTCAT CCCTGTGGAC CGTGAGGATA 
18961 CTGCGTACTC GTACAAGGCG CGGTTCACCC TAGCTGTGGG TGATAACCGT GTGCTGGACA 
19021 TGGCTTCCAC GTACTTTGAC ATCCGCGGCG TGCTGGACAG GGGCCCTACT TTTAAGCCCT 
19081 ACTCTGGCAC TGCCTACAAC GCCCTGGCTC CCAAGGGTGC CCCAAATCCT TGCGAATGGG 
19141 ATGAAGCTGC TACTGCTCTT GAAATAAACC TAGAAGAAGA GGACGATGAC AACGAAGACG 
19201 AAGTAGACGA GCAAGCTGAG CAGCAAAAAA CTCACGTATT TGGGCAGGCG CCTTATTCTG 
19261 GTATAAATAT TACAAAGGAG GGTATTCAAA TAGGTGTCGA AGGTCAAACA CCTAAATATG 
19321 CCGATAAAAC ATTTCAACCT GAACCTCAAA TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
19381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA AGACTACCCC AATGAAACCA TGTTACGGTT 
19441 CATATGCAAA ACCCACAAAT GAAAATGGAG GGCAAGGCAT TCTTGTAAAG CAACAAAATG 
19501 GAAAGCTAGA AAGTCAAGTG GAAATGCAAT TTTTCTCAAC TACTGAGGCG ACCGCAGGCA 
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19561 ATGGTGATAA CTTGACTCCT AAAGTGGTAT TGTACAGTGA AGATGTAGAT ATAGAAACCC 
19S21 CAGACACTCA TATTTCTTAC ATGCCCACTA TTAAGGAAGG TAACTCACGA GAACTAATGG 
19681 GCCAACAATC TATGCCCAAC AGGCCTAATT ACATTGCTTT TAGGGACAAT TTTATTGGTC 
19741 TAATGTATTA CAACAGCACG GGTAATATGG GTGTTCTGGC GGGCCAAGCA TCGCAGTTGA 
19801 ATGCTGTTGT AGATTTGCAA GACAGAAACA CAGAGCTTTC ATACCAGCTT TTGCTTGATT 
19861 CCATTGGTGA TAGAACCAGG TACTTTTCTA TGTGGAATCA GGCTGTTGAC AGCTA TGAT C 
19921 CAGATGTTAG AATTATTGAA AATCATGGAA CTGAAGATGA ACTTCCAAAT TACTGCTTTC 
19981 CACTGGGAGG TGTGATTAAT ACAGAGACTC TTACCAAGGT AAAACCTAAA ACAGGTCAGG 
20041 AAAATGGATG GGAAAAAGAT GCTACAGAAT TTTCAGATAA AAATGAAATA AGAGTTGGAA 
20101 ATAATTTTGC CATGGAAATC AATCTAAATG CCAACCTGTG GAGAAATTTC CTGTACTCCA 
20161 ACATAGCGCT GTATTTGCCC GACAAGCTAA AGTACAGTCC TTCCAACGTA AAAATTTCTG 
20221 ATAACCCAAA CACCTACGAC TACATGAACA AGCGAGTGGT GGCTCCCGGG TTAGTGGACT 
20281 GCTACATTAA CCTTGGAGCA CGCTGGTCCC TTGACTATAT GGACAACGTC AACCCATTTA 
20341 ACCACCACCG CAATGCTGGC CTGCGCTACC GCTCAATGTT GCTGGGCAAT GGTCGCTATG 
20401 TGCCCTTCCA CATCCAGGTG CCTCAGAAGT TCTTTGCCAT TAAAAACCTC CTTCTCCTGC 
20461 CGGGCTCATA CACCTACGAG TGGAACTTCA GGAAGGATGT TAACATGGTT CTGCAGAGCT 
20521 CCCTAGGAAA TGACCTAAGG GTTGACGGAG CCAGCATTAA GTTTGATAGC ATTTGCCTTT 
20581 ACGCCACCTT CTTCCCCATG GCCCACAACA CCGCCTCCAC GCTTGAGGCC ATGCTTAGAA 
20641 ACGACACCAA CGACCAGTCC TTTAACGACT ATCTCTCCGC CGCCAACATG CTCTACCCTA 
20701 TACCCGCCAA CGCTACCAAC GTGCCCATAT CCATCCCCTC CCGCAACTGG GCGGCTTTCC 
20761 GCGGCTGGGC CTTCACGCGC CTTAAGACTA AGGAAACCCC ATCACTGGGC TCGGGCTACG 
20821 ACCCTTATTA CACCTACTCT GGCTCTATAC CCTACCTAGA TGGAACCTTT TACCTCAACC 
20881 ACACCTTTAA GAAGGTGGCC ATTACCTTTG ACTCTTCTGT CAGCTGGCCT GGCAATGACC 
20941 GCCTGCTTAC CCCCAACGAG TTTGAAATTA AGCGCTCAGT TGACGGGGAG GGTTACAACG 
21001 TTGCCCAGTG TAACATGACC AAAGACTGGT TCCTGGTACA AATGCTAGCT AACTACAACA 
21061 TTGGCTACCA GGGCTTCTAT ATCCCAGAGA GCTACAAGGA CCGCATGTAC TCCTTCTTTA 
21121 GAAACTTCCA GCCCATGAGC CGTCAGGTGG TGGATGATAC TAAATACAAG GACTACCAAC 
21181 AGGTGGGCAT CCTACACCAA CACAACAACT CTGGATTTGT TGGCTACCTT GCCCCCACCA 
21241 TGCGCGAAGG ACAGGCCTAC CCTGCTAACT TCCCCTATCC GCTTATAGGC AAGACCGCAG 
21301 TTGACAGCAT TACCCAGAAA AAG TTTCT TT GCGATCGCAC CCTTTGGCGC ATCCCATTCT 
21361 CCAGTAACTT TATGTCCATG GGCGCACTCA CAGACCTGGG CCAAAACCTT CTCTACGCCA 
21421 ACTCCGCCCA CGCGCTAGAC ATGACTTTTG AGGTGGATCC CATGGACGAG CCCACCCTTC 
21481 TTTATGTTTT GTTTGAAGTC TTTGACGTGG TCCGTGTGCA CCGGCCGCAC CGCGGCGTCA 
21541 TCGAAACCGT GTACCTGCGC ACGCCCTTCT CGGCCGGCAA CGCCACAACA TAAAGAAGCA 
21601 AGCAACATCA ACAACAGCTG CCGCCATGGG CTCCAGTGAG CAGGAACTGA AAGCCATTGT 
21661 CAAAGATCTT GGTTGTGGGC CATATTTTTT GGGCACCTAT GACAAGCGCT TTCCAGGCTT 
21721 TGTTTCTCCA CACAAGCTCG CCTGCGCCAT AGTCAATACG GCCGGTCGCG AGACTGGGGG 
21781 CGTACACTGG ATGGCCTTTG CCTGGAACCC GCACTCAAAA ACATGCTACC TCTTTGAGCC 
21841 CTTTGGCTTT TCTGACCAGC GACTCAAGCA GGTTTACCAG TTTGAGTACG AGTCACTCCT 
21901 GCGCCGTAGC GCCATTGCTT CTTCCCCCGA CCGCTGTATA ACGCTGGAAA AGTCCACCCA 
21961 AAGCGTACAG GGGCCCAACT CGGCCGCCTG TGGACTATTC TGCTGCATGT TTCTCCACGC 
22021 CTTTGCCAAC TGGCCCCAAA CTCCCATGGA TCACAACCCC ACCATGAACC TTATTACCGG 
22081 GGTACCCAAC TCCATGCTCA ACAGTCCCCA GGTACAGCCC ACCCTGCGTC GCAACCAGGA 
22141 ACAGCTCTAC AGCTTCCTGG AGCGCCACTC GCCCTACTTC CGCAGCCACA GTGCGCAGAT 
22201 TAGGAGCGCC ACTTCTTTTT GTCACTTGAA AAACATGTAA AAATAATGTA CTAGAGACAC 
22261 TTTCAATAAA GGCAAATGCT TTTATTTGTA CACTCTCGGG TGATTATTTA CCCCCACCCT 
22321 TGCCGTCTGC GCCGTTTAAA AATCAAAGGG GTTCTGCCGC GCATCGCTAT GCGCCACTGG 
22381 CAGGGACACG TTGCGATACT GGTGTTTAGT GCTCCACTTA AACTCAGGCA CAACCATCCG 
22441 CGGCAGCTCG GTGAAGTTTT CACTCCACAG GCTGCGCACC ATCACCAACG CGTTTAGCAG 
22501 GTCGGGCGCC GATATCTTGA AGTCGCAGTT GGGGCCTCCG CCCTGCGCGC GCGAGTTGCG 
22561 ATACACAGGG TTGCAGCACT GGAACACTAT CAGCGCCGGG TGGTGCACGC TGGCCAGCAC 
22621 GCTCTTGTCG GAGATCAGAT CCGCGTCCAG GTCCTCCGCG TTGCTCAGGG CGAACGGAGT 
22681 CAACTTTGGT AGCTGCCTTC CCAAAAAGGG CGCGTGCCCA GGCTTTGAGT TGCACTCGCA 
22741 CCGTAGTGGC ATCAAAAGGT GACCGTGCCC GGTCTGGGCG TTAGGATACA GCGCCTGCAT 
22801 AAAAGCCTTG ATCTGCTTAA AAGCCACCTG AGCCTTTGCG CCTTCAGAGA AGAACATGCC 
22861 GCAAGACTTG CCGGAAAACT GATTGGCCGG ACAGGCCGCG TCGTGCACGC AGCACCTTGC 
22921 GTCGGTGTTG GAGATCTGCA CCACATTTCG GCCCCACCGG TTCTTCACGA TCTTGGCCTT 
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22981 GCTAGACTGC TCCTTCAGCG CGCGCTGCCC GTTTTCGCTC GTCACATCCA TTTCAATCAC 
23041 GTGCTCCTTA TTTATCATAA TGCTTCCGTG TAGACACTTA AGCTCGCCTT CGATCTCAGC 
23101 GCAGCGGTGC AGCCACAACG CGCAGCCCGT GGGCTCGTGA TGCTTGTAGG TCACCTCTGC 
23161 AAACGACTGC AGGTACGCCT GCAGGAATCG CCCCATCATC GTCACAAAGG TCTTGTTGCT 
23221 GGTGAAGGTC AGCTGCAACC CGCGGTGCTC CTCGTTCAGC CAGGTCTTGC ATACGGCCGC 
232 Bl CAGAGCTTCC ACTTGGTCAG GCAGTAGTTT GAAGTTCGCC TTTAGATCGT TATCCACGTG 
23341 GTACTTGTCC ATCAGCGCGC GCGCAGCCTC CATGCCCTTC TCCCACGCAG ACACGATCGG 
23401 CACACTCAGC GGGTTCATCA CCGTAATTTC ACTTTCCGCT TCGCTGGGCT CTTCCTCTTC 
23461 CTCTTGCGTC CGCATACCAC GCGCCACTGG GTCGTCTTCA TTCAGCCGCC GCACTGTGCG 
23521 CTTACCTCCT TTGCCATGCT TGATTAGCAC CGGTGGGTTG CTGAAACCCA CCATTTGTAG 
23581 CGCCACATCT TCTCTTTCTT CCTCGCTGTC CACGATTACC TCTGGTGATG GCGGGCGCTC 
23641 GGGCTTGGGA GAAGGGCGCT TCTTTTTCTT CTTGGGCGCA ATGGCCAAAT CCGCCGCCOA 
23701 GGTCGATGGC CGCGGGCTGG GTGTGCGCGG CACCAGCGCG TCTTGTGATG AGTCTTCCTC 
23761 GTCCTCGGAC TCGATACGCC GCCTCATCCG CTTTTTTGGG GGCGCCCGGG GAGGCGGCGG 
23821 CGACGGGGAC GGGGACGACA CGTCCTCCAT GGTTGGGGGA CGTCGCGCCG CACCGCGTCC 
23881 GCGCTCGGGG GTGGTTTCGC GCTGCTCCTC TTCCCGACTG GCCATTTCCT TCTCCTATAG 
23941 GCAGAAAAAG ATCATGGAGT CAGTCGAGAA GAAGGACAGC CTAACCGCCC CCTCTGAGTT 
24001 CGCCACCACC GCCTCCACCG ATGCCGCCAA CGCGCCTACC ACCTTCCCCG TCGAGGCACC 
24061 CCCGCTTGAG GAGGAGGAAG TGATTATCGA GCAGGACCCA GGTTTTGTAA GCGAAGACGA 
24121 CGAGGACCGC TCAGTACCAA CAGAGGATAA AAAGCAAGAC CAGGACAACG CAGAGGCAAA 
24181 CGAGGAACAA GTCGGGCGGG GGGACGAAAG GCATGGCGAC TACCTAGATG TGGGAGACGA 
24241 CGTGCTGTTG AAGCATCTGC AGCGCCAGTG CGCCATTATC TGCGACGCGT TGCAAGAGCG 
24301 CAGCGATGTG CCCCTCGCCA TAGCGGATGT CAGCCTTGCC TACGAACGCC ACCTATTCTC 
24361 ACCGCGCGTA CCCCCCAAAC GCCAAGAAAA CGGCACATGC GAGCCCAACC CGCGCCTCAA 
24421 CTTCTACCCC GTATTTGCCG TGCCAGAGGT GCTTGCCACC TATCACATCT TTTTCCAAAA 
24481 CTGCAAGATA CCCCTATCCT GCCGTGCCAA CCGCAGCCGA GCGGACAAGC AGCTGGCCTT 
24541 GCGGCAGGGC GCTGTCATAC CTGATATCGC CTCGCTCAAC GAAGTGCCAA AAATCTTTGA 
24601 GGGTCTTGGA CGCGACGAGA AGCGCGCGGC AAACGCTCTG CAACAGGAAA ACAGCGAAAA 
24661 TGAAAGTCAC TCTGGAGTGT TGGTGGAACT CGAGGGTGAC AACGCGCGCC TAGCCGTACT 
24721 AAAACGCAGC ATCGAGGTCA CCCACTTTGC CTACCCGGCA CTTAACCTAC CCCCCAAGGT 
24781 CATGAGCACA GTCATGAGTG AGCTGATCGT GCGCCGTGCG CAGCCCCTGG AGAGGGATGC 
24841 AAATTTGCAA GAACAAACAG AGGAGGGCCT ACCCGCAGTT GGCGACGAGC AGCTAGCGCG 
24901 CTGGCTTCAA ACGCGCGAGC CTGCCGACTT GGAGGAGCGA CGCAAACTAA TGATGGCCGC 
24961 AGTGCTCGTT ACCGTGGAGC TTGAGTGCAT GCAGCGGTTC TTTGCTGACC CGGAGATGCA 
25021 GCGCAAGCTA GAGGAAACAT TGCACTACAC CTTTCGACAG GGCTACGTAC GCCAGGCCTG 
25081 CAAGATCTCC AACGTGGAGC TCTGCAACCT GGTCTCCTAC CTTGGAATTT TGCACGAAAA 
25141 CCGCCTTGGG CAAAACGTGC TTCATTCCAC GCTCAAGGGC GAGGCGCGCC GCGACTACGT 
25201 CCGCGACTGC GTTTACTTAT TTCTATGCTA CACCTGGCAG ACGGCCATGG GCGTTTGGCA 
25261 GCAGTGCTTG GAGGAGTGCA ACCTCAAGGA GCTGCAGAAA CTGCTAAAGC AAAACTTGAA 
25321 GGACCTATGG ACGGCCTTCA ACGAGCGCTC CGTGGCCGCG CACCTGGCGG ACATCATTTT 
25381 CCCCGAACGC CTGCTTAAAA CCCTGCAACA GGGTCTGCCA GACTTCACCA GTCAAAGCAT 
25441 GTTGCAGAAC TTTAGGAACT TTATCCTAGA GCGCTCAGGA ATCTTGCCCG CCACCTGCTG 
25501 TGCACTTCCT AGCGACTTTG TGCCCATTAA GTACCGCGAA TGCCCTCCGC CGCTTTGGGG 
25561 CCACTGCTAC CTTCTGCAGC TAGCCAACTA CCTTGCCTAC CACTCTGACA TAATGGAAGA 
25621 CGTGAGCGGT GACGGTCTAC TGGAGTGTCA CTGTCGCTGC AACCTATGCA CCCCGCACCG 
25681 CTCCCTGGTT TGCAATTCGC AGCTGCTTAA CGAAAGTCAA ATTATCGGTA CCTTTGAGCT 
25741 GCAGGGTCCC TCGCCTGACG AAAAGTCCGC GGCTCCGGGG TTGAAACTCA CTCOGGGGCT 
25801 GTGGACGTCG GCTTACCTTC GCAAATTTGT ACCTGAGGAC TACCACGCCC ACGAGATTAG 
25861 GTTCTACGAA GACCAATCCC GCCCGCCAAA TGCGGAGCTT ACCGCCTGCG TCATTACCCA 
25921 GGGCCACATT CTTGGCCAAT TGCAAGCCAT CAACAAAGCC CGCCAAGAGT TTCTGCTACG 
25981 AAAGGGACGG GGGGTTTACT TGGACCCCCA GTCCGGCGAG GAGCTCAACC CAATCCCCCC 
26041 GCCGCCGCAG CCCTATCAGC AGCAGCCGCG GGCCCTTGCT TCCCAGGATG GCACCCAAAA 
26101 AGAAGCTGCA GCTGCCGCCG CCACCCACGG ACGAGGAGGA ATACTGGGAC AGTCAGGCAG 
26161 AGGAGGTTTT GGACGAGGAG GAGGAGGACA TGATGGAAGA CTGGGAGAGC CTAGACGAGG 
26221 AAGCTTCCGA GGTCGAAGAG GTGTCAGACG AAACACCGTC ACCCTCGGTC GCATTCCCCT 
26281 CGCCGGCGCC CCAGAAATCG GCAACCGGTT CCAGCATGGC TACAACCTCC GCTCCTCAGG 
26341 CGCCGCCGGC ACTGCCCGTT CGCCGACCCA ACCGTAGATG GGACACCACT GGAACCAGGG 
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26401 CCGGTAAGTC CAAGCAGCCG CCGCCGTTAG CCCAAGAGCA ACAACAGCGC CAAGGCTACC 
26461 GCTCATGGCG CGGGCACAAG AACGCCATAG TTGCTTGCTT GCAAGACTGT GGGGGCAACA 
26521 TCTCCTTCGC CCGCCGCTTT CTTCTCTACC ATCACGGCGT GGCCTTCCCC CGTAACATCC 
265B1 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG CGGCAGCGGC AGCGGCAGCA 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTGAC AAAGCCCAAG 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC GTCTGGCGCC CAACGAACCC 
26761 GTATCGACCC GCGAGCTTAG AAACAGGATT TTTCCCACTC TGTATGCTAT ATTTCAACAG 
26B21 AGCAGGGGCC AAGAACAAGA GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC 
26681 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
26941 CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT TTCTCAAATT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG CGCCAGCACC TGTCGTCAGC 
27061 GCCATTATGA GCAAGGAAAT TCCCACGCCC TACATGTGGA GTTACCAGCC ACAAATGGGA 
27121 CTTGCGGCTG GAGCTGCCCA AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC 
271B1 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA ACCGAATTCT CTTGGAACAG 
27241 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC CGCTGCCCTG 
27301 GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC CCAGAGACGC CCAGGCOGAA 
27361 GTTCAGATGA CTAACTCAGG GGCGCAGCTT GCGGGCGGCT TTCGTCACAG GGTGCGGTCG 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG 
27481 TCGGTGAGCT CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC CTCTGAGCCG 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT TTGTGCCATC GGTCTACTTT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT TTATTCCTAA CTTTGACGCG 
27721 GTAAAGGACT CGGCGGACGG CTACGACTGA ATGTTAAGTG GAGAGGCAGA GCAACTGCGC 
27781 CTGAAACACC TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT CCGGCTTACC 
27901 GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CGGGAGTTTA CCCAGCGCCC CCTGCTAGTT 
27961 GAGCGGGACA GGGGACCCTG TGTTCTCACT GTGATTTGCA ACTGTCCTAA CCTTGGATTA 
28021 CATCAAGATC TTTGTTGCCA TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA 
28081 CTGGGGCTCC TATCGCCATC CTGTAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
28141 GCGAACCTTA CCTGGTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA GTTTCAACCC 
28201 AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC TACTCCATCA GAAAAAACAC 
28261 CACCCTCCTT ACCTGCCGGG AACGTACGAG TGCGTCACCG GCCGCTGCAC CACACCTACC 
28321 GCCTGACCGT AAACCAGACT TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG 
28381 GAGGTGAGCT TAGAAAACCC TTAGGGTATT AGGCCAAAGG CGCAGCTACT GTGGGGTTTA 
28441 TGAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA AGTCAGGCTT 
28501 CCTGGATGTC AGCATCTGAC TTTGGCCAGC ACCTGTCCCG CGGATTTGTT CCAGTCCAAC 
28561 TACAGCGACC CACCCTAACA GAGATGACCA ACACAACCAA CGCGGCCGCC GCTACCGGAC 
28621 TTACATCTAC CACAAATACA CCCCAAGTTT CTGCCTTTGT CAATAACTGG GATAACTTGG 
28681 GCATGTGGTG GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
28741 GCTGCCTAAA GCGCAAACGC GCCCGACCAC CCATCTATAG TCCCATCATT GTGCTACACC 
28801 CAAACAATGA TGGAATCCAT AGATTGGACG GACTGAAACA CATGTTCTTT TCTCTTACAG 
28861 TATGATTAAA TGAGATCTAG AAATGGACGG AATTATTACA GAGCAGCGCC TGCTAGAAAG 
28921 ACGCAGGGCA GCGGCCGAGC AACAGCGCAT GAATCAAGAG CTCCAAGACA TGGTTAACTT 
28981 GCACCAGTGC AAAAGGGGTA TCTTTTGTCT GGTAAAGCAG GCCAAAGTCA CCTACGACAG 
29041 TAATACCACC GGACACCGCC TTAGCTACAA GTTGCCAACC AAGCGTCAGA AATTGGTGGT 
29101 CATGGTGGGA GAAAAGCCCA TTACCATAAC TCAGCACTCG GTAGAAACCG AAGGCTGCAT 
29161 TCACTCACCT TGTCAAGGAC CTGAGGATCT CTGCACCCTT ATTAAGACCC TGTGCGGTCT 
29221 CAAAGATCTT ATTCCCTTTA ACTAATAAAA AAAAATAATA AAGCATCACT TACTTAAAAT 
29281 CAGTTAGCAA ATTTCTGTCC AGTTTATTCA GCAGCACCTC CTTGCCCTCC TCCCAGCTCT 
29341 GGTATTGCAG CTTCCTCCTG GCTGCAAACT TTCTCCACAA TCTAAATGGA ATGTCAGTTT 
29401 CCTCCTGTTC CTGTCCATCC GCACCCACTA TCTTCATGTT GTTGCAGATG AAGCGCGCAA 
29461 GACCGTCTGA AGATACCTTC AACCCCGTGT ATCCATATGA CACGGAAACC GGTCCTCCAA 
29521 CTGTGCCTTT TCTTACTCCT CCCTTTGTAT CCCCCAATGG GTTTCAAGAG AGTCCCCCTG 
29581 GGGTACTCTC TTTGCGCCTA TCCGAACCTC TAGTTACCTC CAATGGCATG CTTGCGCTCA 
29641 AAATGGGCAA CGGCCTCTCT CTGGACGAGG CCGGCAACCT TACCTCCCAA AATGTAACCA 
29701 CTGTGAGCCC ACCTCTCAAA AAAACCAAGT CAAACATAAA CCTGGAAATA TCTGCACCCC 
29761 TCACAGTTAC CTCAGAAGCC CTAACTGTGG CTGCCGCCGC ACCTCTAATG GTCGCGGGCA 
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29821 ACACACTCAC CATGCAATCA CAGGCCCCGC TAACCGTGCA CGACTCCAAA CTTAGCATTG 
29881 CCACCCAAGG ACCCCTCACA GTGTCAGAAG GAAAGCTAGC CCTGCAAACA TCAGGCCCCC 
29941 TCACCACCAC CGATAGCAGT ACCCTTACTA TCACTGCCTC ACCCCCTCTA ACTACTGCCA 
30001 CTGGTAGCTT GGGCATTGAC TTGAAAGAGC CCATTTATAC ACAAAATGGA AAACTAGGAC 
30061 TAAAGTACGG GGCTCCTTTG CATGTAACAG ACGACCTAAA CACTTTGACC GTAGCAACTG 
30121 GTCCAGGTGT GACTATTAAT AATACTTCCT TGCAAACTAA AGTTACTGGA GCCTTGGGTT 
30181 TTGATTCACA AGGCAATATG CAACTTAATG TAGCAGGAGG ACTAAGGATT GATTCTCAAA 
30241 ACAGACGCCT TATACTTGAT GTTAGTTATC CGTTTGATGC TCAAAACCAA CTAAATCTAA 
30301 GACTAGGACA GGGCCCTCTT TTTATAAACT CAGCCCACAA CTTGGATATT AACTACAACA 
30361 AAGGCCTTTA CTTGTTTACA GCTTCAAACA ATTCCAAAAA GCTTGAGGTT AACCTAAGCA 
30421 CTGCCAAGGG GTTGATGTTT GACGCTACAG CCATAGCCAT TAATGCAGGA GATGGGCTTG 
30481 AATTTGGTTC ACCTAATGCA CCAAACACAA ATCCCCTCAA AACAAAAATT GGCCATGGCC 
30541 TAGAATTTGA TTCAAACAAG GCTATGGTTC CTAAACTAGG AACTGGCCTT AGTTTTGACA 
30601 GCACAGGTGC CATTACAGTA GGAAACAAAA ATAATGATAA GCTAACTTTG TGGACCACAC 
30661 CAGCTCCATC TCCTAACTGT AGACTAAATG CAGAGAAAGA TGCTAAACTC ACTTTGGTCT 
30721 TAACAAAATG TGGCAGTCAA ATACTTGCTA CAGTTTCAGT TTTGGCTGTT AAAGGCAGTT 
30781 TGGCTCCAAT ATCTGGAACA GTTCAAAGTG CTCATCTTAT TATAAGATTT GACGAAAATG 
30841 GAGTGCTACT AAACAATTCC TTCCTGGACC CAGAATATTG GAACTTTAGA AATGGAGATC 
30901 TTACTGAAGG CACAGCCTAT ACAAACGCTG TTGGATTTAT GCCTAACCTA TCAGCTTATC 
30961 CAAAATCTCA CGGTAAAACT GCCAAAAGTA ACATTGTCAG TCAAGTTTAC TTAAACGGAG 
31021 ACAAAACTAA ACCTGTAACA CTAACCATTA CACTAAACGG TACACAGGAA ACAGGAGACA 
31081 CAACTCCAAG TGCATACTCT ATGTCATTTT CATGGGACTG GTCTGGCCAC AACTACATTA 
31141 ATGAAATATT TGCCACATCC TCTTACACTT TTTCATACAT TGCCCAAGAA TAAAGAATCG 
31201 TTTGTGTTAT GTTTCAACGT GTTTATTTTT CAATTGCAGA AAATTTCAAG TCATTTTTCA 
31261 TTCAGTAGTA TAGCCCCACC ACCACATAGC TTATACAGAT CACCGTACCT TAATCAAACT 
31321 CACAGAACCC TAGTATTCAA CCTGCCACCT CCCTCCCAAC ACACAGAGTA CACAGTCCTT 
31381 TCTCCCCGGC TGGCCTTAAA AAGCATCATA TCATGGGTAA CAGACATATT CTTAGGTGTT 
31441 ATATTCCACA CGGTTTCCTG TCGAGCCAAA CGCTCATCAG TGATATTAAT AAACTCCCCG 
31501 GGCAGCTCAC TTAAGTTCAT GTCGCTGTCC AGCTGCTGAG CCACAGGCTG CTGTCCAACT 
31561 TGCGGTTGCT TAACGGGCGG CGAAGGAGAA GTCCACGCCT ACATGGGGGT AGAGTCATAA 
31621 TCGTGCATCA GGATAGGGCG GTGGTGCTGC AGCAGCGCGC GAATAAACTG CTGCCGCCGC 
31681 CGCTCCGTCC TGCAGGAATA CAACATGGCA GTGGTCTCCT CAGCGATGAT TCGCACCGCC 
31741 CGCAGCATAA GGCGCCTTGT CCTCCGGGCA CAGCAGCGCA CCCTGATCTC ACTTAAATCA 
31801 GCACAGTAAC TGCAGCACAG CACCACAATA TTGTTCAAAA TCCCACAGTG CAAGGCGCTG 
31861 TATCCAAAGC TCATGGCGGG GACCACAGAA CCCACGTGGC CATCATACCA CAAGCGCAGG 
31921 TAGATTAAGT GGCGACCCCT CATAAACACG CTGGACATAA ACATTACCTC TTTTGGCATG 
31981 TTGTAATTCA CCACCTCCCG GTACCATATA AACCTCTGAT TAAACATGGC GCCATCCACC 
32041 ACCATCCTAA ACCAGCTGGC CAAAACCTGC CCGCCGGCTA TACACTGCAG GGAACCGGGA 
32101 CTGGAACAAT GACAGTGGAG AGCCCAGGAC TCGTAACCAT GGATCATCAT GCTCGTCATG 
32161 ATATCAATGT TGGCACAACA CAGGCACACG TGCATACACT TCCTCAGGAT TACAAGCTCC 
32221 TCCCGCGTTA GAACCATATC CCAGGGAACA ACCCATTCCT GAATCAGCGT AAATCCCACA 
32281 CTGCAGGGAA GACCTCGCAC GTAACTCACG TTGTGCATTG TCAAAGTGTT ACATTCGGGC 
32341 AGCAGCGGAT GATCCTCCAG TATGGTAGCG CGGGTTTCTG TCTCAAAAGG AGGTAGACGA 
32401 TCCCTACTGT ACGGAGTGCG CCGAGACAAC CGAGATCGTG TTGGTCGTAG TGTCATGCCA 
32461 AATGGAACGC CGGACGTAGT CATATTTCCT GAAGCAAAAC CAGGTGCGGG CGTGACAAAC 
32521 AGATCTGCGT CTCCGGTCTC GCCGCTTAGA TCGCTCTGTG TAGTAGTTGT AGTATATCCA 
32581 CTCTCTCAAA GCATCCAGGC GCCCCCTGGC TTCGGGTTCT ATGTAAACTC CTTCATGCGC 
32641 CGCTGCCCTG ATAACATCCA CCACCGCAGA ATAAGCCACA CCCAGCCAAC CTACACATTC 
32701 GTTCTGCGAG TCACACACGG GAGGAGCGGG AAGAGCTGGA AGAACCATGT TTTTTTTTTT 
32761 ATTCCAAAAG ATTATCCAAA ACCTCAAAAT GAAGATCTAT TAAGTGAACG CGCTCCCCTC 
32821 CGGTGGCGTG GTCAAACTCT ACAGCCAAAG AACAGATAAT GGCATTTGTA AGATGTTGCA 
32881 CAATGGCTTC CAAAAGGCAA ACGGCCCTCA CGTCCAAGTG GACGTAAAGG CTAAACCCTT 
32941 CAGGGTGAAT CTCCTCTATA AACATTCCAG CACCTTCAAC CATGCCCAAA TAATTCTCAT 
33001 CTCGCCACCT TCTCAATATA TCTCTAAGCA AATCCCGAAT ATTAAGTCCG GCCATTGTAA 
33061 AAATCTGCTC CAGAGCGCCC TCCACCTTCA GCCTCAAGCA GCGAATCATG ATTGCAAAAA 
33121 TTCAGGTTCC TCACAGACCT GTATAAGATT CAAAAGCGGA ACATTAACAA AAATACCGCG 
33181 ATCCCGTAGG TCCCTTCGCA GGGCCAGCTG AACATAATCG TGCAGGTCTG CACGGACCAG 
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33241 CGCGGCCACT TCCCCGCCAG GAACCTTGAC AAAAGAACCC ACACTGATTA TGACACGCAT 
33301 ACTCGGAGCT ATGCTAACCA GCGTAGCCCC GATGTAAGCT TTGTTGCATG GGCGGCGATA 
33361 TAAAATGCAA GGTGCTGCTC AAAAAATCAG GCAAAGCCTC GCGCAAAAAA GAAAGCACAT 
33421 CGTAGTCATG CTCATGCAGA TAAAGGCAGG TAAGCTCCGG AACCACCACA GAAAAAGACA 
33481 CCA TTTTT CT CTCAAACATG TCTGCGGGTT TCTGCATAAA CACAAAATAA AATAACAAAA 
33541 AAACATTTAA ACATTAGAAG CCTGTCTTAC AACAGGAAAA ACAACCCTTA TAAGCATAAG 
33601 ACGGACTACG GCCATGCCGG CGTGACCGTA AAAAAACTGG TCACCGTGAT TAAAAAGCAC 
33661 CACCGACAGC TCCTCGGTCA TGTCCGGAGT CATAATGTAA GACTCGGTAA ACACATCAGG 
33721 TTGATTCATC GGTCAGTGCT AAAAAGCGAC CGAAATAGCC CGGGGGAATA CATACCCGCA 
33761 GGCGTAGAGA CAACATTACA GCCCCCATAG GAGGTATAAC AAAATTAATA GGAGAGAAAA 
33841 ACACATAAAC ACCTGAAAAA CCCTCCTGCC TAGGCAAAAT AGCACCCTCC CGCTCCAGAA 
33901 CAACATACAG CGCTTCACAG CGGCAGCCTA ACAGTCAGCC TTACCAGTAA AAAAGAAAAC 
33961 CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC AGTCACAGTG TAAAAAAGGG 
34021 CCAAGTGCAG AGCGAGTATA TATAGGACTA AAAAATGACG TAACGGTTAA AGTCCACAAA 
34081 AAACACCCAG AAAACCGCAC GCGAACCTAC GCCCAGAAAC GAAAGCCAAA AAACCCACAA 
34141 CTTCCTCAAA TCGTCACTTC CGTTTTCCCA CGTTACGTAA CTTCCCATTT TAAGAAAACT 
34201 ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT ACGTCACCCG CCCCGTTCCC 
34261 ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA 
34321 TAAGGTATAT TATTGATGAT G 
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E1A Functions Major Late Transcription Unit 
• Induce Ad genes. 

. Deregulate cell cycle. J* i£ \* ^Jt 

. Drive G, into S-phase. tA«W*C><><>«C><>«t> 
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SEQUENCE LISTING 

<110> Wold, William S.M. 
Toth, Karoly 
Doronin, Konstantin 
Tollefson, Ann E. 

<120> Replication-Competent Anti-Cancer Vectors 

<130> 16153-5152 

<140> 
<141> 

<150> 09/351,778 
<151> 1999-07-12 

<160> 20 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 33592 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 1 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 

ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 

gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 

gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 24 0 

taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 

agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 

gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 

cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 

tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 

tccgacaccg ggactgaaaa tgagacatga ggtactggct gataatcttc cacctcctag 600 

ccattttgaa ccacctaccc ttcacgaact gtatgattta gacgtgacgg cccccgaaga 660 

tcccaacgag gaggcggttt cgcagatttt tcccgactct gtaatgttgg cggtgcagga 720 

agggattgac ttactcactt ttccgccggc gcccggttct ccggagccgc ctcacctttc 780 

ccggcagccc gagcagccgg agcagagagc cttgggtccg gtttgccacg aggctggctt 84 0 

tccacccagt gacgacgagg atgaagaggg tgaggagttt gtgttagatt atgtggagca 900 

ccccgggcac ggttgcaggt cttgtcatta tcaccggagg aatacggggg acccagatat 960 

tatgtgttcg ctttgctata tgaggacctg tggcatgttt gtctacagta agtgaaaatt 1020 

atgggcagtg ggtgatagag tggtgggttt ggtgtggtaa tttttttttt aatttttaca 1080 

gttttgtggt ttaaagaatt ttgtattgtg atttttttaa aaggtcctgt gtctgaacct 1140 

gagcctgagc ccgagccaga accggagcct gcaagaccta cccgccgtcc taaaatggcg 1200 

cctgctatcc tgagacgccc gacatcacct gtgtctagag aatgcaatag tagtacggat 1260 

agctgtgact ccggtccttc taacacacct cctgagatac acccggtggt cccgctgtgc 1320 

cccattaaac cagttgccgt gagagttggt gggcgtcgcc aggctgtgga atgtatcgag 1380 

gacttgctta acgagcctgg gcaacctttg gacttgagct gtaaacgccc caggccataa 1440 

ggtgtaaacc tgtgattgcg tgtgtggtta acgcctttgt ttgctgaatg agttgatgta 1500 

agtttaataa agggtgagat aatgtttaac ttgcatggcg tgttaaatgg ggcggggctt 1560 

aaagggtata taatgcgccg tgggctaatc ttggttacat ctgacctcat ggaggcttgg 1620 

gagtgtttgg aagatttttc tgctgtgcgt aacttgctgg aacagagctc taacagtacc 1680 

tcttggtttt ggaggtttct gtggggctca tcccaggcaa agttagtctg cagaattaag 1740 

gaggattaca agtgggaatt tgaagagctt ttgaaatcct gtggtgagct gtttgattct 1800 

ttgaatctgg gtcaccaggc gcttttccaa gagaaggtca tcaagacttt ggatttttcc 1860 

acaccggggc gcgctgcggc tgctgttgct tttttgagtt ttataaagga taaatggagc 1920 

gaagaaaccc atctgagcgg ggggtacctg ctggattttc tggccatgca tctgtggaga 1980 

gcggttgtga gacacaagaa tcgcctgcta ctgttgtctt ccgtccgccc ggcgataata 204 0 

ccgacggagg agcagcagca gcagcaggag gaagccaggc ggcggcggca ggagcagagc 2100 

ccatggaacc cgagagccgg cctggaccct cgggaatgaa tgttgtacag gtggctgaac 2160 

tgtatccaga actgagacgc attttgacaa ttacagagga tgggcagggg ctaaaggggg 2220 

taaagaggga gcggggggct tgtgaggcta cagaggaggc taggaatcta gcttttagct 2280 

taatgaccag acaccgtcct gagtgtatta cttttcaaca gatcaaggat aattgcgcta 2340 
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atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagat'a gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 



ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 2460 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 27 60 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 324 0 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 384 0 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4 020 
gtattttttc caggacgtgg taaaggtgac 4 080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4920 
ccagaaggcg ctcgccgccc agcgatagca 4 980 
gtttgagacc gtccgccgta ggcatgcttt 504 0 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 5340 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 54 60 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 594 0 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
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gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 
gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgagtcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 



gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 64 20 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 7140 
cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 7440 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7 620 
acctatggcc attttttctg gggtgatgca 7 680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 94 80 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
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tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tccggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 124 80 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12 600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgtct tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
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ctgagttgcc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgc cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt ttggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 
tccagcgagt gaccattact gacgccagac 
tgggcatagt ctcgccgcgc gtcctatcga 
ttatatcgcc cagcaataac acaggctggg 
gggccaagaa gcgctccgac caacacccag 
ggggcgcgca caaacgcggc cgcactgggc 
tggtggagga ggcgcgcaac tacacgccca 
ccattcagac cgtggtgcgc ggagcccggc 
gcgtagcacg tcgccaccgc cgccgacccg 
tgcttaaccg cgcacgtcgc accggccgac 
ccgcgggtat tgtcactgtg ccccccaggt 
cggccattag tgctatgact cagggtcgca 
ttagcggcct gcgcgtgccc gtgcgcaccc 
actacttaga ctcgtactgt tgtatgtatc 
ccaagcgcaa aatcaaagaa gagatgctcc 
cgaagaagga agagcaggat tacaagcccc 
aagatgatga tgatgaactt gacgacgagg 
gacgggtaca gtggaaaggt cgacgcgtaa 
tctttacgcc cggtgagcgc tccacccgca 
gcgacgagga cctgcttgag caggccaacg 
ggcataagga catgctggcg ttgccgctgg 
ccgtaacact gcagcaggtg ctgcccgcgc 
agcgcgagtc tggtgacttg gcacccaccg 
tggaagatgt cttggaaaaa atgaccgtgg 
ggccaatcaa gcaggtggcg ccgggactgg 
ctaccagtag caccagtatt gccaccgcca 
ttgcctcagc ggtggcggat gccgcggtgc 
ctacggaggt gcaaacggac ccgtggatgt 
gttcgaggaa gtacggcgcc gccagcgcgc 
ttgcgcctac ccccggctat cgtggctaca 
gacgccgaac caccactgga acccgccgcc 
cgatttccgt gcgcagggtg gctcgcgaag 
gctaccaccc cagcatcgtt taaaagccgg 
cctgccgcct ccgtttcccg gtgccgggat 
tggccggcca cggcctgacg ggcggcatgc 
cgcaccgtcg catgcgcggc ggtatcctgc 
ttggcgccgt gcccggaatt gcatccgtgg 
caagttgcat gtggaaaaat caaaataaaa 
taactatttt gtagaatgga agacatcaac 
cgcccgttca tgggaaactg gcaagatatc 
agctggggct cgctgtggag cggcattaaa 
agcaaggcct ggaacagcag cacaggccag 
ttccaacaaa aggtggtaga tggcctggcc 



gtgtgtacct ggtggacaac aagtcaacgg 14280 

acagcaactt tctgaccacg gtcattcaaa 14340 

cacagaccat caatcttgac gaccggtcgc 14400 

ataccaacat gccaaatgtg aacgagttca 14 460 

tggtgtcgcg cttgcctact aaggacaatc 14520 

tcacgctgcc cgagggcaac tactccgaga 14580 

tcgtggagca ctacttgaaa gtgggcagac 14640 

taaagtttga cacccgcaac ttcagactgg 14700 

ctggggtata tacaaacgaa gccttccatc 14760 

tggacttcac ccacagccgc ctgagcaact 14820 

aggagggctt taggatcacc tacgatgatc 14880 

atgtggacgc ctaccaggcg agcttgaaag 14940 

gcggcagcaa cagcagtggc agcggcgcgg 15000 

tgcagccggt ggaggacatg aacgatcatg 150 60 

ctgaggagaa gcgcgctgag gccgaagcag 15120 

ccgaggtcga gaagcctcag aagaaaccgg 15180 

aacgcagtta caacctaata agcaatgaca 15240 

ttgcatacaa ctacggcgac cctcagaccg 15300 

ctgacgtaac ctgcggctcg gagcaggtct 15360 

ccgtgacctt ccgctccacg cgccagatca 15420 

tgcccgtgca ctccaagagc ttctacaacg 15480 

agtttacctc tctgacccac gtgttcaatc 15540 

cgccagcccc caccatcacc accgtcagtg 15600 

cgctaccgct gcgcaacagc atcggaggag 15660 

gccgcacctg cccctacgtt tacaaggccc 15720 

gccgcacttt ttgagcaagc atgtccatcc 15780 

gcctgcgctt cccaagcaag atgtttggcg 15840 

tgcgcgtgcg cgggcactac cgcgcgccct 15900 

gcaccaccgt cgatgacgcc atcgacgcgg 15960 

cgccgccacc agtgtccaca gtggacgcgg 16020 

gctatgctaa aatgaagaga cggcggaggc 16080 

gcactgccgc ccaacgcgcg gcggcggccc 16140 

gggcggccat gcgggccgct cgaaggctgg 16200 

ccaggcgacg agcggccgcc gcagcagccg 16260 

ggggcaacgt gtattgggtg cgcgactcgg 16320 

gccccccgcg caactagatt gcaagaaaaa 16380 

cagcggcggc ggcgcgcaac gaagctatgt 164 40 

aggtcatcgc gccggagatc tatggccccc 16500 

gaaagctaaa gcgggtcaaa aagaaaaaga 16560 

tggaactgct gcacgctacc gcgcccaggc 16620 

aacgtgtttt gcgacccggc accaccgtag 16680 

cctacaagcg cgtgtatgat gaggtgtacg 16740 

agcgcctcgg ggagtttgcc tacggaaagc 16800 

acgagggcaa cccaacacct agcctaaagc 16860 

ttgcaccgtc cgaagaaaag cgcggcctaa 16920 

tgcagctgat ggtacccaag cgccagcgac 16980 

aacctgggct ggagcccgag gtccgcgtgc 17040 

gcgtgcagac cgtggacgtt cagataccca 17100 

cagagggcat ggagacacaa acgtccccgg 17160 

aggcggtcgc tgcggccgcg tccaagacct 17220 

ttcgcgtttc agccccccgg cgcccgcgcg 17280 

tactgcccga atatgcccta catccttcca 17340 

cctaccgccc cagaagacga gcaactaccc 17400 

gccgtcgccg tcgccagccc gtgctggccc 17460 

gaggcaggac cctggtgctg ccaacagcgc 17520 

tctttgtggt tcttgcagat atggccctca 17580 

tccgaggaag aatgcaccgt aggaggggca 17640 

gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 

ccctccttat tccactgatc gccgcggcga 17760 

ccttgcaggc gcagagacac tgattaaaaa 17820 

agtctggact ctcacgctcg cttggtcctg 17880 

tttgcgtctc tggccccgcg acacggctcg 17940 

ggcaccagca atatgagcgg tggcgccttc 18000 

aatttcggtt ccaccgttaa gaactatggc 18060 

atgctgaggg ataagttgaa agagcaaaat 18120 

tctggcatta gcggggtggt ggacctggcc 18180 
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aaccaggcag tgcaaaataa gattaacagt 
cctccaccgg ccgtggagac agtgtctcca 
gacagggaag aaactctggt gacgcaaata 
aagcaaggcc tgcccaccac ccgtcccatc 
cacacacccg taacgctgga cctgcctccc 
ccaggcccga ccgccgttgt tgtaacccgt 
agcggtccgc gatcgttgcg gcccgtagcc 
atcgtgggtc tgggggtgca atccctgaag 
tcgtatgtgt gtcatgtatg cgtccatgtc 
gcccgctttc caagatggct accccttcga 
cgggccagga cgcctcggag tacctgagcc 
agacgtactt cagcctgaat aacaagttta 
tgaccacaga ccggtcccag cgtttgacgc 
ctgcgtactc gtacaaggcg cggttcaccc 
tggcttccac gtactttgac atccgcggcg 
actctggcac tgcctacaac gccctggctc 
atgaagctgc tactgctctt gaaataaacc 
aagtagacga gcaagctgag cagcaaaaaa 
gtataaatat tacaaaggag ggtattcaaa 
ccgataaaac atttcaacct gaacctcaaa 
ttaatcatgc agctgggaga gtccttaaaa 
catatgcaaa acccacaaat gaaaatggag 
gaaagctaga aagtcaagtg gaaatgcaat 
atggtgataa cttgactcct aaagtggtat 
cagacactca tatttcttac atgcccacta 
gccaacaatc tatgcccaac aggcctaatt 
taatgtatta caacagcacg ggtaatatgg 
atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 



aagcttgatc cccgccctcc cgtagaggag 18240 
gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacgagcctc cctcgtacga ggaggcacta 18360 
gcgcccatgg ctaccggagt gctgggccag 18420 
cccgccgaca cccagcagaa acctgtgctg 18480 
cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agtggcaact ggcaaagcac actgaacagc 18600 
cgccgacgat gcttctgaat agctaacgtg 18660 
gccgccagag gagctgctga gccgccgcgc 18720 
tgatgccgca gtggtcttac atgcacatct 18780 
ccgggctggt gcagtttgcc cgcgccaccg 18840 
gaaaccccac ggtggcgcct acgcacgacg 18900 
tgcggttcat ccctgtggac cgtgaggata 18960 
tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 19440 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 19740 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 204 00 
tctttgccat taaaaacctc cttctcctgc 204 60 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 207 60 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 217 80 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
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acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaaccca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 2604 0 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
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agaagctgca gctgccgccg ccacccacgg 
aggaggtttt ggacgaggag gaggaggaca 
aagcttccga ggtcgaagag gtgtcagacg 
cgccggcgcc ccagaaatcg gcaaccggtt 
cgccgccggc actgcccgtt cgccgaccca 
ccggtaagtc caagcagccg ccgccgttag 
gctcatggcg cgggcacaag aacgccatag 
tctccttcgc ccgccgcttt cttctctacc 
tgcattacta ccgtcatctc tacagcccat 
acagcagcgg ccacacagaa gcaaaggcga 
aaatccacag cggcggcagc agcaggagga 
gtatcgaccc gcgagcttag aaacaggatt 
agcaggggcc aagaacaaga gctgaaaata 
agctgcctgt atcacaaaag cgaagatcag 
ctcttcagta aatactgcgc gctgactctt 
taagcgcgaa aactacgtca tctccagcgg 
gccattatga gcaaggaaat tcccacgccc 
cttgcggctg gagctgccca agactactca 
cacatgatat cccgggtcaa cggaatccgc 
gcggctatta ccaccacacc tcgtaataac 
gtgtaccagg aaagtcccgc tcccaccact 
gttcagatga ctaactcagg ggcgcagctt 
cccgggcagg gtataactca cctgacaatc 
tcggtgagct cctcgcttgg tctccgtccg 
cgtccttcat tcacgcctcg tcaggcaatc 
cgctctggag gcattggaac tctgcaattt 
aaccccttct cgggacctcc cggccactat 
gtaaaggact cggcggacgg ctacgactga 
ctgaaacacc tggtccactg tcgccgccac 
tgctactttg aattgcccga ggatcatatc 
tataataaat acagaaatta aaatatactg 
cgtcttcacc cgcccaagca aaccaaggcg 
ctctgtgatt tacaacagtt tcaacccaga 
gctcagctac tccatcagaa aaaacaccac 
ttaaaagtca ggcttcctgg atgtcagcat 
ttgttccagt ccaactacag cgacccaccc 
ccgccgctac cggacttaca tctaccacaa 
actgggataa cttgggcatg tggtggttct 
ttatgtggct catctgctgc ctaaagcgca 
tcattgtgct acacccaaac aatgatggaa 
tcttttctct tacagtatga ttaaatgaga 
agcagcacct ccttgccctc ctcccagctc 
tttctccaca atctaaatgg aatgtcagtt 
atcttcatgt tgttgcagat gaagcgcgca 
tatccatatg acacggaaac cggtcctcca 
tcccccaatg ggtttcaaga gagtccccct 
ctagttacct ccaatggcat gcttgcgctc 
gccggcaacc ttacctccca aaatgtaacc 
tcaaacataa acctggaaat atctgcaccc 
gctgccgccg cacctctaat ggtcgcgggc 
ctaaccgtgc acgactccaa acttagcatt 
ggaaagctag ccctgcaaac atcaggcccc 
atcactgcct caccccctct aactactgcc 
cccatttata cacaaaatgg aaaactagga 
gacgacctaa acactttgac cgtagcaact 
ttgcaaacta aagttactgg agccttgggt 
gtagcaggag gactaaggat tgattctcaa 
ccgtttgatg ctcaaaacca actaaatcta 
tcagcccaca acttggatat taactacaac 
aattccaaaa agcttgaggt taacctaagc 
gccatagcca ttaatgcagg agatgggctt 
aatcccctca aaacaaaaat tggccatggc 
cctaaactag gaactggcct tagttttgac 
aataatgata agctaacttt gtggaccaca 
gcagagaaag atgctaaact cactttggtc 
acagtttcag ttttggctgt taaaggcagt 



acgaggagga 


atactgggac 


agtcaggcag 


26160 


tgatggaaga 


ctgggagagc 


ctagacgagg 




aaacaccgtc 


accctcggtc 


gcattcccct 


26280 


ccagcatggc 


tacaacctcc 


gctcctcagg 


26340 


accgtagatg 


ggacaccact 


ggaaccaggg 


26400 


cccaagagca 


acaacagcgc 


caaggctacc 


26460 


ttgcttgctt 


gcaagactgt 


gggggcaaca 


26520 


atcacggcgt 


ggccttcccc 


cgtaacatcc 


26580 


actgcaccgg 


cggcagcggc 


agcggcagca 


26640 


ccggatagca 


agactctgac 


aaagcccaag 


26700 


ggagcgctgc 


gtctggcgcc 


caacgaaccc 


2 67 60 


tttcccactc 


tgtatgctat 


atttcaacag 


26820 


aaaaacaggt 


ctctgcgatc 


cctcacccgc 




cttcggcgca 


cgctggaaga 


cgcggaggct 


26940 


aaggactagt 


ttcgcgccct 




27000 


ccacacccgg 


cgccagcacc 


tgtcgtcagc 


27060 


tacatgtgga 


gttaccagcc 


acaaatggga 


27120 


acccgaataa 


actacatgag 


cgcgggaccc 


27180 


gcccaccgaa 




cttggaacag 


27240 


cttaatcccc 


gtagttggcc 


cgctgccctg 


27300 


gtggtacttc 


ccagagacgc 


ccaggccgaa 


27360 


gcgggcggct 


ttcgtcacag ggtgcggtcg 27420 


agagggcgag 


gtattcagct 


caacgacgag 


27480 


gacgggacat 


ttcagatcgg 


cggcgccggc 


27540 


ctaactctgc 


agacctcgtc 


ctctgagccg 


27600 


attgaggagt 


ttgtgccatc 


ggtctacttt 


27660 


ccggatcaat 




ctttgacgcg 


27720 


taattaagtg 


gagaggcaga gcaactgcgc 


27780 


aagtgctttg 


cccgcgactc 


cggtgagttt 


27840 


gaggatcttt 


gttgccatct 


ctgtgctgag 


27900 


gggctcctat 


cgccatcctg 


taaacgccac 


27960 


aaccttacct 


ggtactttta 


acatctctcc 


28020 


cggagtgagt 


ctacgagaga 


acctctccga 


28080 


cctccttacc 


tgccgggaac 


gtacccttaa 


28140 


ctgactttgg 


ccagcacctg 


tcccgcggat 


28200 


taacagagat 


gaccaacaca 


accaacgcgg 


28260 


atacacccca 


agtttctgcc 


tttgtcaata 


28320 


ccatagcgct 


tatgtttgta 




28380 


aacgcgcccg 


accacccatc 


tatagtccca 


28440 


tccatagatt 


ggacggactg 


aaacacatgt 


28500 


ttaattaagg 


aatttctgtc 


cagtttattc 


28560 


tggtattgca 




ggctgcaaac 


28620 


tcctcctgtt 


cctgtccatc 


cgcacccact 


28680 


agaccgtctg 


aagatacctt 


caaccccgtg 


28740 


actgtgcctt 




tccctttgta 


28800 


ggggtactct 


ctttgcgcct 


atccgaacct 


28860 


aaaatgggca 


acggcctctc 


tctggacgag 


28920 


actgtgagcc 




aaaaaccaag 


28980 


ctcacagtta 


cctcagaagc 


cctaactgtg 


29040 


aacacactca 


ccatgcaatc 


acaggccccg 


29100 


gccacccaag 




agtgtcagaa 


29160 


ctcaccacca 


ccgatagcag 




29220 


actggtagct 


tgggcattga 


cttgaaagag 


29280 


ctaaagtacg 


gggctccttt 


gcatgtaaca 


29340 


ggtccaggtg 


tgactattaa 


taatacttcc 


29400 


tttgattcac 


aaggcaatat 


gcaacttaat 


29460 


aacagacgcc 


ttatacttga 


tgttagttat 


29520 


agactaggac 


agggccctct 


ttttataaac 


29580 


aaaggccttt 


acttgtttac 


agcttcaaac 


29640 


actgccaagg 


ggttgatgtt 


tgacgctaca 


29700 


gaatttggtt 


cacctaatgc 




29760 


ctagaatttg 


attcaaacaa 


ggctatggtt 


29820 


agcacaggtg 


ccattacagt 


aggaaacaaa 


29880 






tagactaaat 


29940 


ttaacaaaat 


gtggcagtca 


aatacttgct 


30000 


ttggctccaa 


tatctggaac 


agttcaaagt 


30060 
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gctcatctta ttataagatt tgacgaaaat 
ccagaatatt ggaactttag aaatggagat 
qttggattta tgcctaacct atcagcttat 
aacattgtca gtcaagttta cttaaacgga 
acactaaacg gtacacagga aacaggagac 
tcatgggact ggtctggcca caactacatt 
ttttcataca ttgcccaaga ataaagaatc 
tcaattgcag aaaatttcaa gtcatttttc 
cttatacaga tcaccgtacc ttaatcaaac 
tccctcccaa cacacagagt a'cacagtcct 
atcatgggta acagacatat tcttaggtgt 
acgctcatca gtgatattaa taaactcccc 
cagctgctga gccacaggct gctgtccaac 
agtccacgcc tacatggggg tagagtcata 
cagcagcgcg cgaataaact gctgccgccg 
agtggtctcc tcagcgatga ttcgcaccgc 
acagcagcgc accctgatct cacttaaatc 
attgttcaaa atcccacagt gcaaggcgct 
acccacgtgg ccatcatacc acaagcgcag 
gctggacata aacattacct cttttggcat 
aaacctctga ttaaacatgg cgccatccac 
cccgccggct atacactgca gggaaccggg 
ctcgtaacca tggatcatca tgctcgtcat 
gtgcatacac ttcctcagga ttacaagctc 
aacccattcc tgaatcagcg taaatcccac 
gttgtgcatt gtcaaagtgt tacattcggg 
gcgggtttct gtctcaaaag gaggtagacg 
ccgagatcgt gttggtcgta gtgtcatgcc 
tgaagcaaaa ccaggtgcgg gcgtgacaaa 
atcgctctgt gtagtagttg tagtatatcc 
cttcgggttc tatgtaaact ccttcatgcg 
aataagccac acccagccaa cctacacatt 
gaagagctgg aagaaccatg tttttttttt 
tgaagatcta ttaagtgaac gcgctcccct 
gaacagataa tggcatttgt aagatgttgc 
acgtccaagt ggacgtaaag gctaaaccct 
gcaccttcaa ccatgcccaa ataattctca 
aaatcccgaa tattaagtcc ggccattgta 
agcctcaagc agcgaatcat gattgcaaaa 
tcaaaagcgg aacattaaca aaaataccgc 
gaacataatc gtgcaggtct gcacggacca 
caaaagaacc cacactgatt atgacacgca 
cgatgtaagc tttgttgcat gggcggcgat 
ggcaaagcct cgcgcaaaaa agaaagcaca 
gtaagctccg gaaccaccac agaaaaagac 
ttctgcataa acacaaaata aaataacaaa 
caacaggaaa aacaaccctt ataagcataa 
aaaaaaactg gtcaccgtga ttaaaaagca 
tcataatgta agactcggta aacacatcag 
ccgaaatagc ccgggggaat acatacccgc 
ggaggtataa caaaattaat aggagagaaa 
ctaggcaaaa tagcaccctc ccgctccaga 
aacagtcagc cttaccagta aaaaagaaaa 
ccagctcaat cagtcacagt gtaaaaaagg 
aaaaaatgac gtaacggtta aagtccacaa 
cgcccagaaa cgaaagccaa aaaacccaca 
acgttacgta acttcccatt ttaagaaaac 
ccctaaaacc tacgtcaccc gccccgttcc 
ccctcattat catattggct tcaatccaaa 

<210> 2 
<211> 34341 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 2 



ggagtgctac taaacaattc cttcctggac 30120 
cttactgaag gcacagccta tacaaacgct 30180 
ccaaaatctc acggtaaaac tgccaaaagt 30240 
gacaaaacta aacctgtaac actaaccatt 30300 
acaactccaa gtgcatactc tatgtcattt 30360 
aatgaaatat ttgccacatc ctcttacact 30420 
gtttgtgtta tgtttcaacg tgtttatttt 30480 
attcagtagt atagccccac caccacatag 30540 
tcacagaacc ctagtattca acctgccacc 30600 
ttctccccgg ctggccttaa aaagcatcat 30660 
tatattccac acggtttcct gtcgagccaa 30720 
gggcagctca cttaagttca tgtcgctgtc 30780 
ttgcggttgc ttaacgggcg gcgaaggaga 3084 0 
atcgtgcatc aggatagggc ggtggtgctg 30900 
ccgctccgtc ctgcaggaat acaacatggc 30960 
ccgcagcata aggcgccttg tcctccgggc 31020 
agcacagtaa ctgcagcaca gcaccacaat 31080 
gtatccaaag ctcatggcgg ggaccacaga 31140 
gtagattaag tggcgacccc tcataaacac 31200 
gttgtaattc accacctccc ggtaccatat 31260 
caccatccta aaccagctgg ccaaaacctg 31320 
actggaacaa tgacagtgga gagcccagga 313B0 
gatatcaatg ttggcacaac acaggcacac 31440 
ctcccgcgtt agaaccatat cccagggaac 31500 
actgcaggga agacctcgca cgtaactcac 31560 
cagcagcgga tgatcctcca gtatggtagc 31620 
atccctactg tacggagtgc gccgagacaa 31680 
aaatggaacg ccggacgtag tcatatttcc 31740 
cagatctgcg tctccggtct cgccgcttag 31800 
actctctcaa agcatccagg cgccccctgg 31860 
ccgctgccct gataacatcc accaccgcag 31920 
cgttctgcga gtcacacacg ggaggagcgg 31980 
tattccaaaa gattatccaa aacctcaaaa 32040 
ccggtggcgt ggtcaaactc tacagccaaa 32100 
acaatggctt ccaaaaggca aacggccctc 32160 
tcagggtgaa tctcctctat aaacattcca 32220 
tctcgccacc ttctcaatat atctctaagc 32280 
aaaatctgct ccagagcgcc ctccaccttc 32340 
attcaggttc ctcacagacc tgtataagat 32400 
gatcccgtag gtcccttcgc agggccagct 32460 
gcgcggccac ttccccgcca ggaaccttga 32520 
tactcggagc tatgctaacc agcgtagccc 32580 
ataaaatgca aggtgctgct caaaaaatca 32640 
tcgtagtcat gctcatgcag ataaaggcag 32700 
accatttttc tctcaaacat gtctgcgggt 32760 
aaaacattta aacattagaa gcctgtctta 32820 
gacggactac ggccatgccg gcgtgaccgt 32880 
ccaccgacag ctcctcggtc atgtccggag 32940 
gttgattcat cggtcagtgc taaaaagcga 33000 
aggcgtagag acaacattac agcccccata 33060 
aacacataaa cacctgaaaa accctcctgc 33120 
acaacataca gcgcttcaca gcggcagcct 33180 
cctattaaaa aaacaccact cgacacggca 33240 
gccaagtgca gagcgagtat atataggact 33300 
aaaacaccca gaaaaccgca cgcgaaccta 33360 
acttcctcaa atcgtcactt ccgttttccc 33420 
tacaattccc aacacataca agttactccg 334 80 
cacgccccgc gccacgtcac aaactccacc 33540 
ataaggtata ttattgatga tg 33592 
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catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 



gaagccaata tgataatgag ggggtggagt 60 

gcgggtgacg tagtagtgtg gcggaagtgt 120 

gcgacggatg tggcaaaagt gacgtttttg 180 

ttttcgcgcg gttttaggcg gatgttgtag 240 

ccattttcgc gggaaaactg aataagagga 300 

tagcgcgtaa tatttgtcta gggccgcggg 360 

aggtgttttt ctcaggtgtt ttccgcgttc 420 

tcagctgacg tgtagtgtat ttatacccgg 480 

cagcgagtag agttttctcc tccgagccgc 54 0 

ggtactggct gataatcttc cacctcctag 600 

gtatgattta gacgtgacgg cccccgaaga 660 

tcccgactct gtaatgttgg cggtgcagga 720 

gcccggttct ccggagccgc ctcacctttc 780 

cttgggtccg gtttgccacg aggctggctt 84 0 

tgaggagttt gtgttagatt atgtggagca 900 

tcaccggagg aatacggggg acccagatat 960 

tggcatgttt gtctacagta agtgaaaatt 1020 

ggtgtggtaa tttttttttt aatttttaca 1080 

atttttttaa aaggtcctgt gtctgaacct 1140 

gcaagaccta cccgccgtcc taaaatggcg 1200 

gtgtctagag aatgcaatag tagtacggat 1260 

cctgagatac acccggtggt cccgctgtgc 1320 

gggcgtcgcc aggctgtgga atgtatcgag 1380 

gacttgagct gtaaacgccc caggccataa 1440 

acgcctttgt ttgctgaatg agttgatgta 1500 

ttgcatggcg tgttaaatgg ggcggggctt 1560 

ttggttacat ctgacctcat ggaggcttgg 1620 

aacttgctgg aacagagctc taacagtacc 1680 

tcccaggcaa agttagtctg cagaattaag 1740 

ttgaaatcct gtggtgagct gtttgattct 1800 

gagaaggtca tcaagacttt ggatttttcc 18 60 

tttttgagtt ttataaagga taaatggagc 1920 

ctggattttc tggccatgca tctgtggaga 1980 

ctgttgtctt ccgtccgccc ggcgataata 2040 

gaagccaggc ggcggcggca ggagcagagc 2100 

cgggaatgaa tgttgtacag gtggctgaac 2160 

ttacagagga tgggcagggg ctaaaggggg 2220 

cagaggaggc taggaatcta gcttttagct 2280 

cttttcaaca gatcaaggat aattgcgcta 2340 

ccatagagca gctgaccact tactggctgc 2400 

gggtatatgc aaaggtggca cttaggccag 24 60 

atatcaggaa ttgttgctac atttctggga 2520 

atagggtggc ctttagatgt agcatgataa 2580 

gggtggttat tatgaatgta aggtttactg 2640 

ccaataccaa ccttatccta cacggtgtaa 2700 

aagcctggac cgatgtaagg gttcggggct 2760 

tgtgtcgccc caaaagcagg gcttcaatta 2820 

gtatcctgtc tgagggtaac tccagggtgc 2880 

tcatgctagt gaaaagcgtg gctgtgatta 2940 

acagggcctc tcagatgctg acctgctcgg 3000 

acgtagccag ccactctcgc aaggcctggc 3060 

gttccttgca tttgggtaac aggagggggg 3120 

acactaagat attgcttgag cccgagagca 3180 

acatgaccat gaagatctgg aaggtgctga 3240 

cctgcgagtg tggcggtaaa catattagga 3300 

agctgaggcc cgatcacttg gtgctggcct 3360 

aagatacaga ttgaggtact gaaatgtgtg 3420 

aaggtggggg tcttatgtag ttttgtatct 3480 

ccaactcgtt tgatggaagc attgtgagct 3540 

ccggggtgcg tcagaatgtg atgggctcca 3600 

actctactac cttgacctac gagaccgtgt 3660 

ccgccgcttc agccgctgca gccaccgccc 3720 

gcccgcttgc aagcagtgca gcttcccgtt 3780 

ttttggcaca attggattct ttgacccggg 384 0 

atctgcgcca gcaggtttct gccctgaagg 3900 

taaataaaaa accagactct gtttggattt 3960 
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ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 



taggggtttt gcgcgcgcgg taggcccggg 4020 

gtattttttc caggacgtgg taaaggtgac 4080 

cgtctctggg gtggaggtag caccactgca 4140 

tgatccagtc gtagcaggag cgctgggcgt 4200 

tgattgccag gggcaggccc ttggtgtaag 42 60 

gcatacgtgg ggatatgaga tgcatcttgg 4320 

ccatatccct ccggggattc atgttgtgca 4380 

tgggaaattt gtcatgtagc ttagaaggaa 44 4 0 

gacctccaag attttccatg cattcgtcca 4500 

cctgggcgaa gatatttctg ggatcactaa 4560 

cataggccat ttttacaaag cgcgggcgga 4620 

ccggcccagg ggcgtagtta ccctcacaga 4 680 

gggggatcat gtctacctgc ggggcgatga 474 0 

gctgggaaga aagcaggttc ctgagcagct 4800 

tcacacctat taccgggtgc aactggtagt 4860 

gcaggggggc cacttcgtta agcatgtccc 4 920 

ccagaaggcg ctcgccgccc agcgatagca 4 98 0 

gtttgagacc gtccgccgta ggcatgcttt 504 0 

cccacagctc ggtcacctgc tctacggcat 5100 

gttggggcgg ctttcgctgt acggcagtag 5160 

gtctttccac gggcgcaggg tcctcgtcag 5220 

tccgggctgc gcgctggcca gggtgcgctt 5280 

ccggtcttcg ccctgcgcgt cggccaggta 534 0 

ctccgcggcg tggcccttgg cgcgcagctt 5400 

gtgcagactt ttgagggcgt agagcttggg 54 60 

atccgcgccg caggccccgc agacggtctc 552 0 

ttcggggtca aaaaccaggt ttcccccatg 5580 

catgagccgg tgtccacgct cggtgacgaa 564 0 

gagaggcctg tcctcgagcg gtgttccgcg 5700 

tgagacaaag gctcgcgtcc aggccagcac 57 60 

gttgtccact agggggtcca ctcgctccag 5820 

atcaaggaag gtgattggtt tgtaggtgta 5880 

gctataaaag ggggtggggg cgcgttcgtc 594 0 

ggccagctgt tggggtgagt actccctctg 6000 

gtcagtttcc aaaaacgagg aggatttgat 6060 

ggtggccgca tccatctggt cagaaaagac 6120 

cgacccgtag agggcgttgg acagcaactt 6180 

gcgatcggcg cgctccttgg ccgcgatgtt 6240 

ccattcggga aagacggtgg tgcgctcgtc 6300 

gtgcagggtg acaaggtcaa cgctggtggc 6360 

gcagaggcgg ccgcccttgc gcgagcagaa 6420 

cggggggtct gcgtccacgg taaagacccc 6480 

cttgcatcct tgcaagtcta gcgcctgctg 654 0 

tgggttgagt gggggacccc atggcatggg 6600 

aatgtcgtaa acgtagaggg gctctctgag 6660 

accgcggatg ctggcgcgca cgtaatcgta 6720 

accgaggttg ctacgggcgg gctgctctgc 6780 

tgagttggat gatatggttg gacgctggaa 684 0 

cgcgtcacgc acgaaggagg cgtaggagtc 6900 

ctgcacgtct agggcgcagt agtccagggt 6960 

cttttttttc cacagctcgc ggttgaggac 7020 

gatcggaaac ccgtcggcct ccgaacggta 7080 

ctggtaggcg cagcatccct tttctacggg 714 0 

cgaggtgtgg gtgagcgcaa aggtgtccct 7200 

gtcagtgtcg tcgcatccgc cctgctccca 7260 

cggatttggc agggcgaagg tgacatcgtt 7320 

gttgcgtgtg atgcggaagg gtcccggcac 7380 

gagcacgatc tcgtcaaagc cgttgatgtt 7440 

cgggatgccc ttgatggaag gcaatttttt 7500 

gctgagcccg tgctctgaaa gggcccagtc 7560 

gctccacagg tcacgggcca ttagcatttg 7620 

acctatggcc attttttctg gggtgatgca 7680 

gtcccatcca aggttcgcgg ctaggtctcg 77 4 0 

gaacttcatg accagcatga agggcacgag 7800 

ggtctctaca tcgtaggtga caaagagacg 7860 

gaactggatc tcccgccacc aattggagga 7920 
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gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgag.tcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 
gagcctgtaa gcgggcactc ttccgtggtc 
ggacgaccgg ggttcgagcc ccgtatccgg 
cgcgtgtcga acccaggtgt gcgacgtcag 
tccaggcgcg gcggctgctg cgctagcttt 
gttaggctgg aaagcgaaag cattaagtgg 
caagggttga gtcgcgggac ccccggttcg 
ggggtttgcc tccccgtcat gcaagacccc 
gccccttttt tgcttttccc agatgcatcc 
gcagcggcaa gagcaagagc agcggcagac 
gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 



cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 94 20 
cccgtcatgt cccggttatg ggttggcggg 9480 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 10140 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
tggtggataa attcgcaagg gtatcatggc 10560 
ccgtccgccg tgatccatgc ggttaccgcc 10620 
acaacggggg agtgctcctt ttggcttcct 10680 
tttggccact ggccgcgcgc agcgtaagcg 10740 
ctcgctccct gtagccggag ggttattttc 10800 
agtctcggac cggccggact gcggcgaacg 10860 
gcttgcaaat tcctccggaa acagggacga 10920 
ggtgctgcgg cagatgcgcc cccctcctca 10980 
atgcagggca ccctcccctc ctcctaccgc 11040 
ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 11460 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
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cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggccc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaata'cgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
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gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 

ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 

tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 

ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 

gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 

tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 

ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 

cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 

ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 

actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 

ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 

cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 

aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 

gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 

tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 

gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 

ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 

ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 

agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 

tggaagatgt cttggaaaaa atgaccgtgg aacctgggcc ggagcccgag gtccgcgtgc 17040 

ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 

ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 

ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 

ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 

gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 

ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 

gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 174 60 

cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 

gctaccaccc cagcatcgtt raaaagccgg tctttgtggt tcttgcagat atggccctca 17580 

cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 

tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 

cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 

ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 

caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 

taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 

cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 

agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 

agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 

ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 

aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 

cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 

gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 

aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 

cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 

ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg : cgtccctgcg ccgcgccgcc 18540 

agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 

atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 

tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 

gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 

cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 

agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 

tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 

ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 

tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 

actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 

atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 

aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 

gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 

ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 

ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 

catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 

gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 

atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 

cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 

gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 

taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
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atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 204 60 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacccaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttc'cgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
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gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 2544 0 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 2 6220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 2 6280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 2 64 60 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
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gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctaga agtcaggctt 28500 
cctggatgtc agcatctgac tttggccagc acctgtcccg cggatttgtt ccagtccaac 28560 
tacagcgacc caccctaaca gagatgacca acacaaccaa cgcggccgcc gctaccggac 28620 
ttacatctac cacaaataca ccccaagttt ctgcctttgt caataactgg gataacttgg 28680 
gcatgtggtg gttctccata gcgcttatgt ttgtatgcct tattattatg tggctcatct 28740 
gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag tcccatcatt gtgctacacc 28800 
caaacaatga tggaatccat agattggacg gactgaaaca catgttcttt tctcttacag 28860 
tatgattaaa tgagatctag aaatggacgg aattattaca gagcagcgcc tgctagaaag 28920 
acgcagggca gcggccgagc aacagcgcat gaatcaagag ctccaagaca tggttaactt 28980 
gcaccagtgc aaaaggggta tcttttgtct ggtaaagcag gccaaagtca cctacgacag 29040 
taataccacc ggacaccgcc ttagctacaa gttgccaacc aagcgtcaga aattggtggt 29100 
catggtggga gaaaagccca ttaccataac tcagcactcg gtagaaaccg aaggctgcat 29160 
tcactcacct tgtcaaggac ctgaggatct ctgcaccctt attaagaccc tgtgcggtct 29220 
caaagatctt attcccttta actaataaaa aaaaataata aagcatcact tacttaaaat 29280 
cagttagcaa atttctgtcc agtttattca gcagcacctc cttgccctcc tcccagctct 29340 
ggtattgcag cttcctcctg gctgcaaact ttctccacaa tctaaatgga atgtcagttt 29400 
cctcctgttc ctgtccatcc gcacccacta tcttcatgtt gttgcagatg aagcgcgcaa 294 60 
gaccgtctga agataccttc aaccccgtgt atccatatga cacggaaacc ggtcctccaa 29520 
ctgtgccttt tcttactcct ccctttgtat cccccaatgg gtttcaagag agtccccctg 29580 
gggtactctc tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 29640 
aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa aatgtaacca 29700 
ctgtgagccc acctctcaaa aaaaccaagt caaacataaa cctggaaata tctgcacccc 29760 
tcacagttac ctcagaagcc ctaactgtgg ctgccgccgc acctctaatg gtcgcgggca 29820 
acacactcac catgcaatca caggccccgc taaccgtgca cgactccaaa cttagcattg 29880 
ccacccaagg acccctcaca gtgtcagaag gaaagctagc cctgcaaaca tcaggccccc 29940 
tcaccaccac cgatagcagt acccttacta tcactgcctc accccctcta actactgcca 30000 
ctggtagctt gggcattgac ttgaaagagc ccatttatac acaaaatgga aaactaggac 30060 
taaagtacgg ggctcctttg catgtaacag acgacctaaa cactttgacc gtagcaactg 30120 
gtccaggtgt gactattaat aatacttcct tgcaaactaa agttactgga gccttgggtt 30180 
ttgattcaca aggcaatatg caacttaatg tagcaggagg actaaggatt gattctcaaa 30240 
acagacgcct tatacttgat gttagttatc cgtttgatgc tcaaaaccaa ctaaatctaa 30300 
gactaggaca gggccctctt tttataaact cagcccacaa cttggatatt aactacaaca 30360 
aaggccttta cttgtttaca gcttcaaaca attccaaaaa gcttgaggtt aacctaagca 30420 
ctgccaaggg gttgatgttt gacgctacag ccatagccat taatgcagga gatgggcttg 30480 
aatttggttc acctaatgca ccaaacacaa atcccctcaa aacaaaaatt ggccatggcc 30540 
tagaatttga ttcaaacaag gctatggttc ctaaactagg aactggcctt agttttgaca 30600 
gcacaggtgc cattacagta ggaaacaaaa ataatgataa gctaactttg tggaccacac 30660 
cagctccatc tcctaactgt agactaaatg cagagaaaga tgctaaactc actttggtct 30720 
taacaaaatg tggcagtcaa atacttgcta cagtttcagt tttggctgtt aaaggcagtt 30780 
tggctccaat atctggaaca gttcaaagtg ctcatcttat tataagattt gacgaaaatg 30840 
gagtgctact aaacaattcc ttcctggacc cagaatattg gaactttaga aatggagatc 30900 
ttactgaagg cacagcctat acaaacgctg ttggatttat gcctaaccta tcagcttatc 30960 
caaaatctca cggtaaaact gccaaaagta acattgtcag tcaagtttac ttaaacggag 31020 
acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa acaggagaca 31080 
caactccaag tgcatactct atgtcatttt catgggactg gtctggccac aactacatta 31140 
atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa taaagaatcg 31200 
tttgtgttat gtttcaacgt gtttattttt caattgcaga aaatttcaag tcatttttca 31260 
ttcagtagta tagccccacc accacatagc ttatacagat caccgtacct taatcaaact 31320 
cacagaaccc tagtattcaa cctgccacct ccctcccaac acacagagta cacagtcctt 31380 
tctccccggc tggccttaaa aagcatcata tcatgggtaa cagacatatt cttaggtgtt 314 40 
atattccaca cggtttcctg tcgagccaaa cgctcatcag tgatattaat aaactccccg 31500 
ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
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cgctccgtcc 


tgcaggaata 


caacatggca 


gtggtctcct 


cagcgatgat 


tcgcaccgcc 


31740 


cgcagcataa 


ggcgccttgt 


cctccgggca 


cagcagcgca 


ccctgatctc 


acttaaatca 


31800 


gcacagtaac 


tgcagcacag 


caccacaata 


ttgttcaaaa 


tcccacagtg 


caaggcgctg 


31860 


tatccaaagc 


tcatggcggg 


gaccacagaa 


cccacgtggc 


•catcatacca 


caagcgcagg 


31920 


tagattaagt 


ggcgacccct 


cataaacacg 


ctggacataa 


acattacctc 


ttttggcatg 


31980 


ttgtaattca 


ccacctcccg 


gtaccatata 


aacctctgat 


taaacatggc 


gccatccacc 


32040 


accatcctaa 


accagctggc 


caaaacctgc 


ccgccggcta 


tacactgcag 


ggaaccggga 


32100 


ctggaacaat 


gacagtggag 


agcccaggac 


tcgtaaccat 


ggatcatcat 


gctcgtcatg 


32160 


atatcaatgt 


tggcacaaca 


caggcacacg 


tgcatacact 


tcctcaggat 


tacaagctcc 


32220 


tcccgcgtta 


gaaccatatc 


ccagggaaca 


acccattcct 


gaatcagcgt 


aaatcccaca 


32280 


ctgcagggaa 


gacctcgcac 


gtaactcacg 


ttgtgcattg 


tcaaagtgtt 


acattcgggc 


32340 


agcagcggat 


gatcctccag 


tatggtagcg 


cgggtttctg 


tctcaaaagg 


aggtagacga 


32400 


tccctactgt 


acggagtgcg 


ccgagacaac 


cgagatcgtg 


ttggtcgtag 


tgtcatgcca 


32460 


aatggaacgc 


cggacgtagt 


catatttcct 


gaagcaaaac 


caggtgcggg 


cgtgacaaac 


32520 


agatctgcgt 


ctccggtctc 


gccgcttaga 


tcgctctgtg 


tagtagttgt 


agtatatcca 


32580 


ctctctcaaa 


gcatccaggc 


gccccctggc 


ttcgggttct 


atgtaaactc 


cttcatgcgc 


32640 


cgctgccctg 


ataacatcca 


ccaccgcaga 




cccagccaac 


ctacacattc 


32700 


gttctgcgag 


tcacacacgg 


gaggagcggg 


aagagctgga 


agaaccatgt 


tttttttttt 


32760 


attccaaaag 


attatccaaa 


acctcaaaat 


gaagatctat 


taagtgaacg 


cgctcccctc 


32820 


cggtggcgtg 


gtcaaactct 


acagccaaag 


aacagataat 


ggcatttgta 


agatgttgca 


32880 


caatggcttc 


caaaaggcaa 


acggccctca 


cgtccaagtg 


gacgtaaagg 


ctaaaccctt 


32940 


cagggtgaat 


ctcctctata 


aacattccag 


caccttcaac 


catgcccaaa 


taattctcat 


33000 


ctcgccacct 


tctcaatata 


tctctaagca 


aatcccgaat 


attaagtccg 


gccattgtaa 


33060 


aaatctgctc 


cagagcgccc 




gcctcaagca 


gcgaatcatg 


attgcaaaaa 


33120 


ttcaggttcc 


tcacagacct 


gtataagatt 


caaaagcgga 


acattaacaa 


aaataccgcg 


33180 


atcccgtagg 


tcccttcgca 


gggccagctg aacataatcg 


tgcaggtctg 


cacggaccag 


33240 


cgcggccact 


tccccgccag 


gaaccttgac 


aaaagaaccc 


acactgatta 


tgacacgcat 


33300 


actcggagct 


atgctaacca 


gcgtagcccc gatgtaagct 


ttgttgcatg 


ggcggcgata 


33360 


taaaatgcaa 


ggtgctgctc 


aaaaaatcag gcaaagcctc 


gcgcaaaaaa 


gaaagcacat 


33420 


cgtagtcatg 


ctcatgcaga 


taaaggcagg 


taagctccgg 


aaccaccaca 


gaaaaagaca 


33480 


ccatttttct 


ctcaaacatg 


tctgcgggtt 


tctgcataaa 


cacaaaataa 


aataacaaaa 


33540 


aaacatttaa 


acattagaag 


cctgtcttac 




acaaccctta 


taagcataag 


33600 


acggactacg 


gccatgccgg 


cgtgaccgta 


aaaaaactgg 


tcaccgtgat 


taaaaagcac 


33660 


caccgacagc 


tcctcggtca 


tgtccggagt 




gactcggtaa 


acacatcagg 


33720 


ttgattcatc 


ggtcagtgct 


aaaaagcgac 


cgaaatagcc 


cgggggaata 


catacccgca 


33780 


ggcgtagaga 




gcccccatag gaggtataac 




ggagagaaaa 


33840 


acacataaac 


acctgaaaaa 


ccctcctgcc 


taggcaaaat 


agcaccctcc 


cgctccagaa 


33900 




cgcttcacag 


cggcagccta 


acagtcagcc 


ttaccagtaa 


aaaagaaaac 


33960 




aacaccactc 


gacacggcac 




agtcacagtg 


taaaaaaggg 


34020 


ccaagtgcag 


agcgagtata 


tataggacta 


aaaaatgacg 


taacggttaa 


agtccacaaa 


34080 


aaacacccag 


aaaaccgcac 


gcgaacctac 


gcccagaaac 


gaaagccaaa 


aaacccacaa 


34140 


cttcctcaaa 


tcgtcacttc 


cgttttccca cgttacgtaa 




taagaaaact 


34200 


acaattccca 




gttactccgc 


cctaaaacct 


acgtcacccg 


ccccgttccc 


34260 


acgccccgcg 


ccacgtcaca 


aactccaccc 


cctcattatc 


atattggctt 


caatccaaaa 


34320 


taaggtatat 


tattgatgat 


g 








34341 
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catcatcaat 


aatatacctt 


attttggatt 


gaagccaata 


tgataatgag 


ggggtggagt 


60 


ttgtgacgtg 


gcgcggggcg 


tgggaacggg 


gcgggtgacg 


tagtagtgtg 


gcggaagtgt 


120 


gatgttgcaa 


gtgtggcgga 


acacatgtaa gcgacggatg 


tggcaaaagt 


gacgtttttg 


180 


gtgtgcgccg 


gtgtacacag 


gaagtgacaa ttttcgcgcg 


gttttaggcg 


gatgttgtag 


240 


taaatttggg 


cgtaaccgag 


taagatttgg ccattttcgc 


gggaaaactg 


aataagagga 


300 


agtgaaatct 


gaataatttt 


gtgttactca 


tagcgcgtaa 


tatttgtcta 


gggccgcggg 




gactttgacc 


gtttacgtgg 


agactcgccc 


aggtgttttt 


ctcaggtgtt 


ttccgcgttc 


420 


cgggtcaaag 


ttggcgtttt 


attattatag 


tcagctgacg 


tgtagtgtat 


ttatacccgg 


480 


tgagttcctc 


aagaggccac 


tcttgagtgc 


cagcgagtag 


agttttctcc 


tccgagccgc 


540 


tccgacaccg 


ggactgaaaa 


tgagacatat 


tatctgccac 


ggaggtgtta 


ttaccgaaga 


600 


aatggccgcc 


agtcttttgg 


accagctgat 


cgaagaggta 


ctggctgata 


atcttccacc 


660 


tcctagccat 


tttgaaccac 




cgaactgtat 


gatttagacg 


tgacggcccc 


720 


cgaagatccc 


aacgaggagg 


cggtttcgca 


gatttttccc 


gactctgtaa 


tgttggcggt 


780 


gcaggaaggg 


attgacttac 


tcacttttcc 


gccggcgccc 


ggttctccgg 


agccgcctca 


840 



Copied from 1020674? on 07-04-2004 



PCT/US00/1897I 



cctttcccgg cagcccgagc agccggagca gagagccttg ggtccggttt ctatgccaaa 900 
ccttgtaccg gaggtgatcg atcttacctg ccacgaggct ggctttccac ccagtgacga 960 
cgaggatgaa gagggtgagg agtttgtgtt agattatgtg gagcaccccg ggcacggttg 1020 
caggtcttgt cattatcacc ggaggaatac gggggaccca gatattatgt gttcgctttg 1080 
ctatatgagg acctgtggca tgtttgtcta cagtaagtga aaattatggg cagtgggtga 1140 
tagagtggtg ggtttggtgt ggtaattttt tttttaattt ttacagtttt gtggtttaaa 1200 
gaattttgta ttgtgatttt tttaaaaggt cctgtgtctg aacctgagcc tgagcccgag 1260 
ccagaaccgg agcctgcaag acctacccgc cgtcctaaaa tggcgcctgc tatcctgaga 1320 
cgcccgacat cacctgtgtc tagagaatgc aatagtagta cggatagctg tgactccggt 1380 
ccttctaaca cacctcctga gatacacccg gtggtcccgc tgtgccccat taaaccagtt 1440 
gccgtgagag ttggtgggcg tcgccaggct gtggaatgta tcgaggactt gcttaacgag 1500 
cctgggcaac ctttggactt gagctgtaaa cgccccaggc cataaggtgt aaacctgtga 1560 
ttgcgtgtgt ggttaacgcc tttgtttgct gaatgagttg atgtaagttt aataaagggt 1620 
gagataatgt ttaacttgca tggcgtgtta aatggggcgg ggcttaaagg gtatataatg 1680 
cgccgtgggc taatcttggt tacatctgac ctcatggagg cttgggagtg tttggaagat 1740 
ttttctgctg tgcgtaactt gctggaacag agctctaaca gtacctcttg gttttggagg 1800 
tttctgtggg gctcatccca ggcaaagtta gtctgcagaa ttaaggagga ttacaagtgg 1860 
gaatttgaag agcttttgaa atcctgtggt gagctgtttg attctttgaa tctgggtcac 1920 
caggcgcttt tccaagagaa ggtcatcaag actttggatt tttccacacc ggggcgcgct 1980 
gcggctgctg ttgctttttt gagttttata aaggataaat ggagcgaaga aacccatctg 2040 
agcggggggt acctgctgga ttttctggcc atgcatctgt ggagagcggt tgtgagacac 2100 
aagaatcgcc tgctactgtt gtcttccgtc cgcccggcga taataccgac ggaggagcag 2160 
cagcagcagc aggaggaagc caggcggcgg cggcaggagc agagcccatg gaacccgaga 2220 
gccggcctgg accctcggga atgaatgttg tacaggtggc tgaactgtat ccagaactga 2280 
gacgcatttt gacaattaca gaggatgggc aggggctaaa gggggtaaag agggagcggg 234 0 
'gggcttgtga ggctacagag gaggctagga atctagcttt tagcttaatg accagacacc 2400 
gtcctgagtg tattactttt caacagatca aggataattg cgctaatgag cttgatctgc 2460 
tggcgcagaa gtattccata gagcagctga ccacttactg gctgcagcca ggggatgatt 2520 
ttgaggaggc tattagggta tatgcaaagg tggcacttag gccagattgc aagtacaaga 2580 
tcagcaaact tgtaaatatc aggaattgtt gctacatttc tgggaacggg gccgaggtgg 2640 
agatagatac ggaggatagg gtggccttta gatgtagcat gataaatatg tggccggggg 2700 
tgcttggcat ggacggggtg gttattatga atgtaaggtt tactggcccc aattttagcg 2760 
gtacggtttt cctggccaat accaacctta tcctacacgg tgtaagcttc tatgggttta 2820 
acaa'tacctg tgtggaagcc tggaccgatg taagggttcg gggctgtgcc ttttactgct 2880 
gctggaaggg ggtggtgtgt cgccccaaaa gcagggcttc aattaagaaa tgcctctttg 2940 
aaaggtgtac cttgggtatc ctgtctgagg gtaactccag ggtgcgccac aatgtggcct 3000 
ccgactgtgg ttgcttcatg ctagtgaaaa gcgtggctgt gattaagcat aacatggtat 3060 
gtggcaactg cgaggacagg gcctctcaga tgctgacctg ctcggacggc aactgtcacc 3120 
tgctgaagac cattcacgta gccagccact ctcgcaaggc ctggccagtg tttgagcata 3180 
acatactgac ccgctgttcc ttgcatttgg gtaacaggag gggggtgttc ctaccttacc 3240 
aatgcaattt gagtcacact aagatattgc ttgagcccga gagcatgtcc aaggtgaacc 3300 
tgaacggggt gtttgacatg accatgaaga tctggaaggt gctgaggtac gatgagaccc 3360 
gcaccaggtg cagaccctgc gagtgtggcg gtaaacatat taggaaccag cctgtgatgc 3420 
tggatgtgac cgaggagctg aggcccgatc acttggtgct ggcctgcacc cgcgctgagt 3480 
ttggctctag cgatgaagat acagattgag gtactgaaat gtgtgggcgt ggcttaaggg 3540 
tgggaaagaa tatataaggt gggggtctta tgtagttttg tatctgtttt gcagcagccg 3600 
ccgccgccat gagcaccaac tcgtttgatg gaagcattgt gagctcatat ttgacaacgc 3660 
gcatgccccc atgggccggg gtgcgtcaga atgtgatggg ctccagcatt gatggtcgcc 3720 
ccgtcctgcc cgcaaactct actaccttga cctacgagac cgtgtctgga acgccgttgg 3780 
agactgcagc ctccgccgcc gcttcagccg ctgcagccac cgcccgcggg attgtgactg 3840 
actttgcttt cctgagcccg cttgcaagca gtgcagcttc ccgttcatcc gcccgcgatg 3900 
acaagttgac ggctcttttg gcacaattgg attctttgac ccgggaactt aatgtcgttt 3960 
ctcagcagct gttggatctg cgccagcagg tttctgccct gaaggcttcc tcccctccca 4020 
atgcggttta aaacataaat aaaaaaccag actctgtttg gatttggatc aagcaagtgt 4080 
cttgctgtct ttatttaggg gttttgcgcg cgcggtaggc ccgggaccag cggtctcggt 4140 
cgttgagggt cctgtgtatt ttttccagga cgtggtaaag gtgactctgg atgttcagat 4200 
acatgggcat aagcccgtct ctggggtgga ggtagcacca ctgcagagct tcatgctgcg 4260 
gggtggtgtt gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc ctaaaaatgt 4 320 
ctttcagtag caagctgatt gccaggggca ggcccttggt gtaagtgttt acaaagcggt 4 380 
taagctggga tgggtgcata cgtggggata tgagatgcat cttggactgt atttttaggt 4440 
tggctatgtt cccagccata tccctccggg gattcatgtt gtgcagaacc accagcacag 4500 
tgtatccggt gcacttggga aatttgtcat gtagcttaga aggaaatgcg tggaagaact 4560 
tggagacgcc cttgtgacct ccaagatttt ccatgcattc gtccataatg atggcaatgg 4620 
gcccacgggc ggcggcctgg gcgaagatat ttctgggatc actaacgtca tagttgtgtt 4680 
ccaggatgag atcgtcatag gccattttta caaagcgcgg gcggagggtg ccagactgcg 4740 
gtataatggt tccatccggc ccaggggcgt agttaccctc acagatttgc atttcccacg 4800 
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ctttgagttc agatgggggg atcatgtcta 
gggtagggga gatcagctgg gaagaaagca 
cggtgggccc gtaaatcaca cctattaccg 
tgccgtcatc cctgagcagg ggggccactt 
ccctgaccaa atccgccaga aggcgctcgc 
caaagttttt caacggtttg agaccgtccg 
gcagttccag gcggtcccac agctcggtca 
ctcctcgttt cgcgggttgg ggcggctttc 
acgggccagg gtcatgtctt tccacgggcg 
ggtgaagggg tgcgctccgg gctgcgcgct 
ggtgctgaag cgctgccggt cttcgccctg 
gtcatagtcc agcccctccg cggcgtggcc 
gccgcacgag gggcagtgca gacttttgag 
ttccggggag taggcatccg cgccgcaggc 
ggtgagctct ggccgttcgg ggtcaaaaac 
cttacctctg gtttccatga gccggtgtcc 
cccgtataca gacttgagag gcctgtcctc 
aaactcggac cactctgaga caaaggctcg 
ggaggggtag cggtcgttgt ccactagggg 
gtcgccctct tcggcatcaa ggaaggtgat 
tgttcctgaa ggggggctat aaaagggggt 
atcgctgtct gcgagggcca gctgttgggg 
ttctgcgcta agattgtcag tttccaaaaa 
ggtgatgcct ttgagggtgg ccgcatccat 
aagcttggtg gcaaacgacc cgtagagggc 
ggtttggttt ttgtcgcgat cggcgcgctc 
gcgcgcaacg caccgccatt cgggaaagac 
gcgccaaccg cggttgtgca gggtgacaag 
gcgctcgttg gtccagcaga ggcggccgcc 
tagctgcgtc tcgtccgggg ggtctgcgtc 
gtcgaagtag tctatcttgc atccttgcaa 
aagcgcgcgc tcgtatgggt tgagtggggg 
ggcgtacatg ccgcaaatgt cgtaaacgta 
agggtagcat cttccaccgc ggatgctggc 
agcgaggagg tcgggaccga ggttgctacg 
cctgaagatg gcatgtgagt tggatgatat 
gtctgtgaga cctaccgcgt cacgcacgaa 
cagctcggcg gtgacctgca cgtctagggc 
atacttatcc tgtccctttt ttttccacag 
tttccagtac tcttggatcg gaaacccgtc 
gaactggttg acggcctggt aggcgcagca 
cgcggccttc cggagcgagg tgtgggtgag 
gtactggtat ttgaagtcag tgtcgtcgca 
gcgctttttg gaacgcggat ttggcagggc 
cgcgcgaggc ataaagttgc gtgtgatgcg 
aattacctgg gcggcgagca cgatctcgtc 
aagttccaag aagcgcggga tgcccttgat 
gagctcttca ggggagctga gcccgtgctc 
ggaagcgacg aatgagctcc acaggtcacg 
ggtcctaaac tggcgaccta tggccatttt 
gtcttgttcc cagcggtccc atccaaggtt 
aggctcatct ccgccgaact tcatgaccag 
ccccatccaa gtataggtct ctacatcgta 
cgagccgatc gggaagaact ggatctcccg 
gtgaaagtag aagtccctgc gacgggccga 
gcagtactgg cagcggtgca cgggctgtac 
cacaaggaag cagagtggga atttgagccc 
tacttcggct gcttgtcctt gaccgtctgg 
caccacgccg cgcgagccca aagtccagat 
aacatcgcgc agatgggagc tgtccatggt 
gagctcctgc aggtttacct cgcatagacg 
cctaatttcc aggggctggt tggtggcggc 
cggcgcgact acggtaccgc gcggcgggcg 
atctaaaagc ggtgacgcgg gcgagccccc 
agagggggca ggggcacgtc ggcgccgcgc 
ttgctggcga acgcgacgac gcggcggttg 



cctgcggggc gatgaagaaa acggtttccg 4860 
ggttcctgag cagctgcgac ttaccgcagc 4 920 
ggtgcaactg gtagttaaga gagctgcagc 4 980 
cgttaagcat gtccctgact cgcatgtttt 5040 
cgcccagcga tagcagttct tgcaaggaag 5100 
ccgtaggcat gcttttgagc gtttgaccaa 5160 
cctgctctac ggcatctcga tccagcatat 5220 
gctgtacggc agtagtcggt gctcgtccag 5280 
cagggtcctc gtcagcgtag tctgggtcac 5340 
ggccagggtg cgcttgaggc tggtcctgct 5400 
cgcgtcggcc aggtagcatt tgaccatggt 5460 
cttggcgcgc agcttgccct tggaggaggc 5520 
ggcgtagagc ttgggcgcga gaaataccga 5580 
cccgcagacg gtctcgcatt ccacgagcca 5640 
caggtttccc ccatgctttt tgatgcgttt 5700 
acgctcggtg acgaaaaggc tgtccgtgtc 5760 
gagcggtgtt ccgcggtcct cctcgtatag 5820 
cgtccaggcc agcacgaagg aggctaagtg 5880 
gtccactcgc tccagggtgt gaagacacat 5940 
tggtttgtag gtgtaggcca cgtgaccggg 6000 
gggggcgcgt tcgtcctcac tctcttccgc 6060 
tgagtactcc ctctgaaaag cgggcatgac 6120 
cgaggaggat ttgatattca cctggcccgc 6180 
ctggtcagaa aagacaatct ttttgttgtc 6240 
gttggacagc aacttggcga tggagcgcag 6300 
cttggccgcg atgtttagct gcacgtattc 6360 
ggtggtgcgc tcgtcgggca ccaggtgcac 6420 
gtcaacgctg gtggctacct ctccgcgtag 6480 
cttgcgcgag cagaatggcg gtagggggtc 6540 
cacggtaaag accccgggca gcaggcgcgc 6600 
gtctagcgcc tgctgccatg cgcgggcggc 6660 
accccatggc atggggtggg tgagcgcgga 6720 
gaggggctct ctgagtattc caagatatgt 6780 
gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 
ggcgggctgc tctgctcgga agactatctg 6900 
ggttggacgc tggaagacgt tgaagctggc 6960 
ggaggcgtag gagtcgcgca gcttgttgac 7020 
gcagtagtcc agggtttcct tgatgatgtc 7080 
ctcgcggttg aggacaaact cttcgcggtc 7140 
ggcctccgaa cggtaagagc ctagcatgta 7200 
tcccttttct acgggtagcg cgtatgcctg 7260 
cgcaaaggtg tccctgacca tgactttgag 7320 
tccgccctgc tcccagagca aaaagtccgt 7380 
gaaggtgaca tcgttgaaga gtatctttcc 7440 
gaagggtccc ggcacctcgg aacggttgtt 7500 
aaagccgttg atgttgtggc ccacaatgta 7560 
ggaaggcaat tttttaagtt cctcgtaggt 7 620 
tgaaagggcc cagtctgcaa gatgagggtt 7 680 
ggccattagc atttgcaggt ggtcgcgaaa 7740 
ttctggggtg atgcagtaga aggtaagcgg 7800 
cgcggctagg tctcgcgcgg cagtcactag 7860 
catgaagggc acgagctgct tcccaaaggc 7920 
ggtgacaaag agacgctcgg tgcgaggatg 7 980 
ccaccaattg gaggagtggc tattgatgtg 8040 
acactcgtgc tggcttttgt aaaaacgtgc 8100 
atcctgcacg aggttgacct gacgaccgcg 8160 
ctcgcctggc gggtttggct ggtggtcttc 8220 
ctgctcgagg ggagttacgg tggatcggac 8280 
gtccgcgcgc ggcggtcgga gcttgatgac 8340 
ctggagctcc cgcggcgtca ggtcaggcgg 8400 
ggtcagggcg cgggctagat ccaggtgata 84 60 
gtcgatggct tgcaagaggc cgcatccccg 8520 
gtgggccgcg ggggtgtcct tggatgatgc 8580 
ggaggtaggg ggggctccgg acccgccggg 8640 
gcgggcagga gctggtgctg cgcgcgtagg 8700 
atctcctgaa tctggcgcct ctgcgtgaag 8760 
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acgacgggcc cggtgagctt gagcctgaaa 
ttgacggcgg cctggcgcaa aatctcctgc 
tcggccatga actgctcgat ctcttcctcc 
gtggcggcga ggtcgttgga aatgcgggcc 
tcgttccaga cgcggctgta gaccacgccc 
tgcgcgagat tgagctccac gtgccgggcg 
aggtagttga gggtggtggc ggtgtgttct 
aacgtggatt cgttgatatc ccccaaggcc 
acggcgaagt tgaaaaactg ggagttgcgc 
cggatgagct cggcgacagt gtcgcgcacc 
tcttcttcaa tctcctcttc cataagggcc 
ggagggggga cacggcggcg acgacggcgc 
atctccccgc ggcgacggcg catggtctcg 
agttggaaga cgccgcccgt catgtcccgg 
agggatacgg cgctaacgat gcatctcaac 
gacctgagcg agtccgcatc gaccggatcg 
tcacagtcgc aaggtaggct gagcaccgtg 
tttctggcgg aggtgctgct gatgatgtaa 
gtcgacagaa gcaccatgtc cttgggtccg 
ccccaggctt cgttttgaca tcggcgcagg 
accggcactt cttcttctcc ttcctcttgt 
gcggcggagt ttggccgtag gtggcgccct 
ctcatcggct gaagcagggc taggtcggcg 
acctgcgtga gggtagactg gaagtcatcc 
ttgatggtgt aagtgcagtt ggccataacg 
gagagctcgg tgtacctgag acgcgagtaa 
gtccgcacca ggtactggta tcccaccaaa 
cagcgtaggg tggccggggc tccgggggcg 
tagatgtacc tggacatcca ggtgatgccg 
cggacgcggt tccagatgtt gcgcagcggc 
ccggtcaggc gcgcgcaatc gttgacgctc 
ggcactcttc cgtggtctgg tggataaatt 
tcgagccccg tatccggccg tccgccgtga 
caggtgtgcg acgtcagaca acgggggagt 
gctgctgcgc tagctttttt ggccactggc 
gcgaaagcat taagtggctc gctccctgta 
gcgggacccc cggttcgagt ctcggaccgg 
ccgtcatgca agaccccgct tgcaaattcc 
ttttcccaga tgcatccggt gctgcggcag 
caagagcagc ggcagacatg cagggcaccc 
acatccgcgg ttgacgcggc agcagatggt 
cactacctgg acttggagga gggcgagggc 
cggtacccaa gggtgcagct gaagcgtgat 
ctgtttcgcg accgcgaggg agaggagccc 
gggcgcgagc tgcggcatgg cctgaatcgc 
cccgacgcgc gaaccgggat tagtcccgcg 
accgcatacg agcagacggt gaaccaggag 
gtgcgtacgc ttgtggcgcg cgaggaggtg 
gtaagcgcgc tggagcaaaa cccaaatagc 
gtgcagcaca gcagggacaa cgaggcattc 
gagggccgct ggctgctcga tttgataaac 
agcttgagcc tggctgacaa ggtggccgcc 
ttttacgccc gcaagatata ccatacccct 
gaggggttct acatgcgcat ggcgctgaag 
tatcgcaacg agcgcatcca caaggccgtg 
cgcgagctga tgcacagcct gcaaagggcc 
gccgagtcct actttgacgc gggcgctgac 
gaggcagctg gggccggacc tgggctggcg 
ggcgtggagg aatatgacga ggacgatgag 
gtgatgtttc tgatcagatg atgcaagacg 
agagccagcc gtccggcctt aactccacgg 
tgtcgctgac tgcgcgcaat cctgacgcgt 
ccgcaattct ggaagcggtg gtcccggcgc 
cgatcgtaaa cgcgctggcc gaaaacaggg 
acgacgcgct gcttcagcgc gtggctcgtt 
accggctggt gggggatgtg cgcgaggccg 



gagagttcga cagaatcaat ttcggtgtcg 8820 
acgtctcctg agttgtcttg ataggcgatc 8880 
tggagatctc cgcgtccggc tcgctccacg 8940 
atgagctgcg agaaggcgtt gaggcctccc 9000 
ccttcggcat cgcgggcgcg catgaccacc 9060 
aagacggcgt agtttcgcag gcgctgaaag 9120 
gccacgaaga agtacataac ccagcgtcgc 9180 
tcaaggcgct ccatggcctc gtagaagtcc 9240 
gccgacacgg ttaactcctc ctccagaaga 9300 
tcgcgctcaa aggctacagg ggcctcttct 9360 
tccccttctt cttcttctgg cggcggtggg 9420 
accgggaggc ggtcgacaaa gcgctcgatc 9480 
gtgacggcgc ggccgttctc gcgggggcgc 9540 
ttatgggttg gcggggggct gccatgcggc 9600 
aattgttgtg taggtactcc gccgccgagg 9660 
gaaaacctct cgagaaaggc gtctaaccag 9720 
gcgggcggca gcgggcggcg gtcggggttg 9780 
ttaaagtagg cggtcttgag acggcggatg 9840 
gcctgctgaa tgcgcaggcg gtcggccatg 9900 
tctttgtagt agtcttgcat gagcctttct 9960 
cctgcatctc ttgcatctat cgctgcggcg 10020 
cttcctccca tgcgtgtgac cccgaagccc 10080 
acaacgcgct cggctaatat ggcctgctgc 1014 0 
atgtccacaa agcggtggta tgcgcccgtg 10200 
gaccagttaa cggtctggtg acccggctgc 10260 
gccctcgagt caaatacgta gtcgttgcaa 10320 
aagtgcggcg gcggctggcg gtagaggggc 10380 
agatcttcca acataaggcg atgatatccg 1044 0 
gcggcggtgg tggaggcgcg cggaaagtcg 10500 
aaaaagtgct ccatggtcgg gacgctctgg 10560 
tagaccgtgc aaaaggagag cctgtaagcg 10620 
cgcaagggta tcatggcgga cgaccggggt 10680 
tccatgcggt taccgcccgc gtgtcgaacc 10740 
gctccttttg gcttccttcc aggcgcggcg 10800 
cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gccggagggt tattttccaa gggttgagtc 10920 
ccggactgcg gcgaacgggg gtttgcctcc 10980 
tccggaaaca gggacgagcc ccttttttgc 11040 
atgcgccccc ctcctcagca gcggcaagag 11100 
tcccctcctc ctaccgcgtc aggaggggcg 11160 
gattacgaac ccccgcggcg ccgggcccgg 11220 
ctggcgcggc taggagcgcc ctctcctgag 11280 
acgcgtgagg cgtacgtgcc gcggcagaac 11340 
gaggagatgc gggatcgaaa gttccacgca 11400 
gagcggttgc tgcgcgagga ggactttgag 114 60 
cgcgcacacg tggcggccgc cgacctggta 11520 
attaactttc aaaaaagctt taacaaccac 11580 
gctataggac tgatgcatct gtgggacttt 11640 
aagccgctca tggcgcagct gttccttata 11700 
agggatgcgc tgctaaacat agtagagccc 117 60 
atcctgcaga gcatagtggt gcaggagcgc 11820 
atcaactatt ccatgcttag cctgggcaag 11880 
tacgttccca tagacaagga ggtaaagatc 11940 
gtgcttacct tgagcgacga cctgggcgtt 12000 
agcgtgagcc ggcggcgcga gctcagcgac 12060 
ctggctggca cgggcagcgg cgatagagag 12120 
ctgcgctggg ccccaagccg acgcgccctg 12180 
gtggcacccg cgcgcgctgg caacgtcggc 12240 
tacgagccag aggacggcga gtactaagcg 12300 
caacggaccc ggcggtgcgg gcggcgctgc 12360 
acgactggcg ccaggtcatg gaccgcatca 12420 
tccggcagca gccgcaggcc aaccggctct 124 80 
gcgcaaaccc cacgcacgag aaggtgctgg 12540 
ccatccggcc cgacgaggcc ggcctggtct 12600 
acaacagcgg caacgtgcag accaacctgg 12660 
tggcgcagcg tgagcgcgcg cagcagcagg 12720 
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gcaacctggg ctccatggtt gcactaaacg 
cgcggggaca ggaggactac accaactttg 
caccgcaaag tgaggtgtac cagtctgggc 
gcctgcagac cgtaaacctg agccaggctt 
gggctcccac aggcgaccgc gcgaccgtgt 
tgctgctgct aatagcgccc ttcacggaca 
gtcacttgct gacactgtac cgcgaggcca 
tccaggagat tacaagtgtc agccgcgcgc 
caaccctaaa ctacctgctg accaaccggc 
acagcgagga ggagcgcatt ttgcgctacg 
gcgacggggt aacgcccagc gtggcgctgg 
tgtatgcctc aaaccggccg tttatcaacc 
ccgtgaaccc cgagtatttc accaatgcca 
gtttctacac cgggggattc gaggtgcccg 
tagacgacag cgtgttttcc ccgcaaccgc 
aggcagaggc ggcgctgcga aaggaaagct 
gcgctgcggc cccgcggtca gatgctagta 
ccagcactcg caccacccgc ccgcgcctgc 
tgctgcagcc gcagcgcgaa aaaaacctgc 
gcctagtgga caagatgagt agatggaaga 
gcccgcgccc gcccacccgt cgtcaaaggc 
acgatgactc ggcagacgac agcagcgtcc 
cgcaccttcg ccccaggctg gggagaatgt 
aaaaactcac caaggccatg gcaccgagcg 
gcgcgcggcg atgtatgagg aaggtcctcc 
gccagtggcg gcggcgctgg gttctccctt 
tccgcggtac ctgcggccta ccggggggag 
cctattcgac accacccgtg tgtacctggt 
gaactaccag aacgaccaca gcaactttct 
cccgggggag gcaagcacac agaccatcaa 
cctgaaaacc atcctgcata ccaacatgcc 
gtttaaggcg cgggtgatgg tgtcgcgctt 
atacgagtgg gtggagttca cgctgcccga 
ccttatgaac aacgcgatcg tggagcacta 
ggaaagcgac atcggggtaa agtttgacac 
cactggtctt gtcatgcctg gggtatatac 
gctgccagga tgcggggtgg acttcaccca 
caagcggcaa cccttccagg agggctttag 
cattcccgca ctgttggatg tggacgccta 
gggcgggggt ggcgcaggcg gcagcaacag 
cgcggcagcc gcggcaatgc agccggtgga 
cacctttgcc acacgggctg aggagaagcg 
cgcccccgct gcgcaacccg aggtcgagaa 
gacagaggac agcaagaaac gcagttacaa 
gtaccgcagc tggtaccttg catacaacta 
gaccctgctt tgcactcctg acgtaacctg 
agacatgatg caagaccccg tgaccttccg 
ggtgggcgcc gagctgttgc ccgtgcactc 
ctcccaactc atccgccagt ttacctctct 
ccagattttg gcgcgcccgc cagcccccac 
tctcacagat cacgggacgc taccgctgcg 
cattactgac gccagacgcc gcacctgccc 
gccgcgcgtc ctatcgagcc gcactttttg 
caataacaca ggctggggcc tgcgcttccc 
ctccgaccaa cacccagtgc gcgtgcgcgg 
acgcggccgc actgggcgca ccaccgtcga 
gcgcaactac acgcccacgc cgccaccagt 
ggtgcgcgga gcccggcgct atgctaaaat 
ccaccgccgc cgacccggca ctgccgccca 
acgtcgcacc ggccgacggg cggccatgcg 
cactgtgccc cccaggtcca ggcgacgagc 
tatgactcag ggtcgcaggg gcaacgtgta 
cgtgcccgtg cgcacccgcc ccccgcgcaa 
gtactgttgt atgtatccag cggcggcggc 
caaagaagag atgctccagg tcatcgcgcc 
gcaggattac aagccccgaa agctaaagcg 



ccttcctgag tacacagccc gccaacgtgc 12780 
tgagcgcact gcggctaatg gtgactgaga 12840 
cagactattt tttccagacc agtagacaag 12900 
tcaaaaactt gcaggggctg tggggggtgc 12960 
ctagcttgct gacgcccaac tcgcgcctgt 13020 
gtggcagcgt grcccgggac acatacctag 13080 
taggtcaggc gcatgtggac gagcatactt 13140 
tggggcagga ggacacgggc agcctggagg 13200 
ggcagaagat cccctcgttg cacagtttaa 13260 
tgcagcagag cgtgagcctt aacctgatgc 13320 
acatgaccgc gcgcaacatg gaaccgggca 13380 
gcctaatgga ctacttgcat cgcgcggccg 13440 
tcttgaaccc gcactggcta ccgccccctg 13500 
agggtaacga tggattcctc tgggacgaca 13560 
agaccctgct agagttgcaa cagcgcgagc 13620 
tccgcaggcc aagcagcttg tccgatctag 13680 
gcccatttcc aagcttgata gggtctctta 13740 
tgggcgagga ggagtaccta aacaactcgc 13800 
ctccggcatt tcccaacaac gggatagaga 13860 
cgtacgcgca ggagcacagg gacgtgccag 13920 
acgaccgtca gcggggtctg gtgtgggagg 13980 
tggatttggg agggagtggc aacccgtttg 14040 
tttaaaaaaa aaaaagcatg atgcaaaata 14100 
ttggttttct tgtattcccc ttagtatgcg 14160 
tccctcctac gagagtgtgg tgagcgcggc 14220 
cgatgctccc ctggacccgc cgtttgtgcc 14280 
aaacagcatc cgttactctg agttggcacc 14340 
ggacaacaag tcaacggatg tggcatccct 14400 
gaccacggtc attcaaaaca atgactacag 14460 
tcttgacgac cggtcgcact ggggcggcga 14520 
aaatgtgaac gagttcatgt ttaccaataa 14580 
gcctactaag gacaatcagg tggagctgaa 14640 
gggcaactac tccgagacca tgaccataga 14700 
cttgaaagtg ggcagacaga acggggttct 14760 
ccgcaacttc agactggggt ttgaccccgt 14820 
aaacgaagcc ttccatccag acatcatttt 14880 
cagccgcctg agcaacttgt tgggcatccg 14940 
gatcacctac gatgatctgg agggtggtaa 15000 
ccaggcgagc ttgaaagatg acaccgaaca 15060 
cagtggcagc ggcgcggaag agaactccaa 15120 
ggacatgaac gatcatgcca ttcgcggcga 15180 
cgctgaggcc gaagcagcgg ccgaagctgc 15240 
gcctcagaag aaaccggtga tcaaacccct 15300 
cctaataagc aatgacagca ccttcaccca 15360 
cggcgaccct cagaccggaa tccgctcatg 15420 
cggctcggag caggtctact ggtcgttgcc 15480 
ctccacgcgc cagatcagca actttccggt 15540 
caagagcttc tacaacgacc aggccgtcta 15600 
gacccacgtg ttcaatcgct ttcccgagaa 15660 
catcaccacc gtcagtgaaa acgttcctgc 15720 
caacagcatc ggaggagtcc agcgagtgac 15780 
ctacgtttac aaggccctgg gcatagtctc 15840 
agcaagcatg tccatcctta tatcgcccag 15900 
aagcaagatg tttggcgggg ccaagaagcg 15960 
gcactaccgc gcgccctggg gcgcgcacaa 16020 
tgacgccatc gacgcggtgg tggaggaggc 16080 
gtccacagtg gacgcggcca ttcagaccgt 16140 
gaagagacgg cggaggcgcg tagcacgtcg 16200 
acgcgcggcg gcggccctgc ttaaccgcgc 16260 
ggccgctcga aggctggccg cgggtattgt 16320 
ggccgccgca gcagccgcgg ccattagtgc 16380 
ttgggtgcgc gactcggtta gcggcctgcg 16440 
ctagattgca agaaaaaact acttagactc 16500 
gcgcaacgaa gctatgtcca agcgcaaaat 16560 
ggagatctat ggccccccga agaaggaaga 16620 
ggtcaaaaag aaaaagaaag atgatgatga 16680 
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tgaacttgac gacgaggtgg aactgctgca cgctaccgcg cccaggcgac gggtacagtg 16740 

gaaaggtcga cgcgtaaaac gtgttttgcg acccggcacc accgtagtct ttacgcccgg 16800 

tgagcgctcc acccgcacct acaagcgcgt gtatgatgag gtgtacggcg acgaggacct 16860 

gcttgagcag gccaacgagc gcctcgggga gtttgcctac ggaaagcggc ataaggacat 16920 

gctggcgttg ccgctggacg agggcaaccc aacacctagc ctaaagcccg taacactgca 16980 

gcaggtgctg cccgcgcttg caccgtccga agaaaagcgc ggcctaaagc gcgagtctgg 17040 

tgacttggca cccaccgtgc agctgatggt acccaagcgc cagcgactgg aagatgtctt 17100 

ggaaaaaatg accgtggaac ctgggctgga gcccgaggtc cgcgtgcggc caatcaagca 17160 

ggtggcgccg ggactgggcg tgcagaccgt ggacgttcag atacccacta ccagtagcac 17220 

cagtattgcc accgccacag agggcatgga gacacaaacg tccccggttg cctcagcggt 17280 

ggcggatgcc gcggtgcagg cggtcgctgc ggccgcgtcc aagacctcta cggaggtgca 17340 

aacggacccg tggatgtttc gcgtttcagc cccccggcgc ccgcgcggtt cgaggaagta 17400 

cggcgccgcc agcgcgctac tgcccgaata tgccctacat ccttccattg cgcctacccc 17460 

cggctatcgt ggctacacct accgccccag aagacgagca actacccgac gccgaaccac 17520 

cactggaacc cgccgccgcc gtcgccgtcg ccagcccgtg ctggccccga tttccgtgcg 17580 

cagggtggct cgcgaaggag gcaggaccct ggtgctgcca acagcgcgct accaccccag 17640 

catcgtttaa aagccggtct ttgtggttct tgcagatatg gccctcacct gccgcctccg 17700 

tttcccggtg ccgggattcc gaggaagaat gcaccgtagg aggggcatgg ccggccacgg 17760 

cctgacgggc ggcatgcgtc gtgcgcacca ccggcggcgg cgcgcgtcgc accgtcgcat 17820 

gcgcggcggt atcctgcccc tccttattcc actgatcgcc gcggcgattg gcgccgtgcc 17880 

cggaattgca tccgtggcct tgcaggcgca gagacactga ttaaaaacaa gttgcatgtg 17940 

gaaaaatcaa aataaaaagt ctggactctc acgctcgctt ggtcctgtaa ctattttgta 18000 

gaatggaaga catcaacttt gcgtctctgg ccccgcgaca cggctcgcgc ccgttcatgg 18060 

gaaactggca agatatcggc accagcaata tgagcggtgg cgccttcagc tggggctcgc 18120 

tgtggagcgg cattaaaaat ttcggttcca ccgttaagaa ctatggcagc aaggcctgga 18180 

acagcagcac aggccagatg ctgagggata agttgaaaga gcaaaatttc caacaaaagg 18240 

tggtagatgg cctggcctct ggcattagcg gggtggtgga cctggccaac caggcagtgc 18300 

aaaataagat taacagtaag cttgatcccc gccctcccgt agaggagcct ccaccggccg 18360 

tggagacagt gtctccagag gggcgtggcg aaaagcgtcc gcgccccgac agggaagaaa 18420 

ctctggtgac gcaaatagac gagcctccct cgtacgagga ggcactaaag caaggcctgc 18480 

ccaccacccg tcccatcgcg cccatggcta ccggagtgct gggccagcac acacccgtaa 18540 

cgctggacct gcctcccccc gccgacaccc agcagaaacc tgtgctgcca ggcccgaccg 18600 

ccgttgttgt aacccgtcct agccgcgcgt ccctgcgccg cgccgccagc ggtccgcgat 18660 

cgttgcggcc cgtagccagt ggcaactggc aaagcacact gaacagcatc gtgggtctgg 18720 

gggtgcaatc cctgaagcgc cgacgatgct tctgaatagc taacgtgtcg tatgtgtgtc 18780 

atgtatgcgt ccatgtcgcc gccagaggag ctgctgagcc gccgcgcgcc cgctttccaa 18840 

gatggctacc ccttcgatga tgccgcagtg gtcttacatg cacatctcgg gccaggacgc 18900 

ctcggagtac ctgagccccg ggctggtgca gtttgcccgc gccaccgaga cgtacttcag 18960 

cctgaataac aagtttagaa accccacggt ggcgcctacg cacgacgtga ccacagaccg 19020 

gtcccagcgt ttgacgctgc ggttcatccc tgtggaccgt gaggatactg cgtactcgta 19080 

caaggcgcgg ttcaccctag ctgtgggtga taaccgtgtg ctggacatgg cttccacgta 19140 

ctttgacatc cgcggcgtgc tggacagggg ccctactttt aagccctact ctggcactgc 19200 

ctacaacgcc ctggctccca agggtgcccc aaatccttgc gaatgggatg aagctgctac 19260 

tgctcttgaa ataaacctag aagaagagga cgatgacaac gaagacgaag tagacgagca 19320 

agctgagcag caaaaaactc acgtatttgg gcaggcgcct tattctggta taaatattac 19380 

aaaggagggt attcaaatag gtgtcgaagg tcaaacacct aaatatgccg ataaaacatt 19440 

tcaacctgaa cctcaaatag gagaatctca gtggtacgaa actgaaatta atcatgcagc 19500 

tgggagagtc cttaaaaaga ctaccccaat gaaaccatgt tacggttcat atgcaaaacc 19560 

cacaaatgaa aatggagggc aaggcattct tgtaaagcaa caaaatggaa agctagaaag 19620 

tcaagtggaa atgcaatttt tctcaactac tgaggcgacc gcaggcaatg gtgataactt 19680 

gactcctaaa gtggtattgt acagtgaaga tgtagatata gaaaccccag acactcatat 19740 

ttcttacatg cccactatta aggaaggtaa ctcacgagaa ctaatgggcc aacaatctat 19800 

gcccaacagg cctaattaca ttgcttttag ggacaatttt attggtctaa tgtattacaa 19860 

cagcacgggt aatatgggtg ttctggcggg ccaagcatcg cagttgaatg ctgttgtaga 19920 

tttgcaagac agaaacacag agctttcata ccagcttttg cttgattcca ttggtgatag 19980 

aaccaggtac ttttctatgt ggaatcaggc tgttgacagc tatgatccag atgttagaat 20040 

tattgaaaat catggaactg aagatgaact tccaaattac tgctttccac tgggaggtgt 20100 

gattaataca gagactctta ccaaggtaaa acctaaaaca ggtcaggaaa atggatggga 20160 

aaaagatgct acagaatttt cagataaaaa tgaaataaga gttggaaata attttgccat 20220 

ggaaatcaat ctaaatgcca acctgtggag aaatttcctg tactccaaca tagcgctgta 20280 

tttgcccgac aagctaaagt acagtccttc caacgtaaaa atttctgata acccaaacac 20340 

ctacgactac atgaacaagc gagtggtggc tcccgggtta gtggactgct acattaacct 20400 

tggagcacgc tggtcccttg actatatgga caacgtcaac ccatttaacc accaccgcaa 204 60 

tgctggcctg cgctaccgct caatgttgct gggcaatggt cgctatgtgc ccttccacat 20520 

ccaggtgcct cagaagttct ttgccattaa aaacctcctt ctcctgccgg gctcatacac 20580 

ctacgagtgg aacttcagga aggatgttaa catggttctg cagagctccc taggaaatga 20640 
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cctaagggtt gacggagcca gcattaagtt 
ccccatggcc cacaacaccg cctccacgct 
ccagtccttt aacgactatc tctccgccgc 
taccaacgtg cccatatcca tcccctcccg 
cacgcgcctt aagactaagg aaaccccatc 
ctactctggc tctataccct acctagatgg 
ggtggccatt acctttgact cttctgtcag 
caacgagttt gaaattaagc gctcagttga 
catgaccaaa gactggttcc tggtacaaat 
cttctatatc ccagagagct acaaggaccg 
catgagccgt caggtggtgg atgatactaa 
acaccaacac aacaactctg gatttgttgg 
ggcctaccct gctaacttcc cctatccgct 
ccagaaaaag tttctttgcg atcgcaccct 
gtccatgggc gcactcacag acctgggcca 
gctagacatg acttttgagg tggatcccat 
tgaagtcttt gacgtggtcc gtgtgcaccg 
cctgcgcacg cccttctcgg ccggcaacgc 
acagctgccg ccatgggctc cagtgagcag 
tgtgggccat attttttggg cacctatgac 
aagctcgcct gcgccatagt caatacggcc 
gcctttgcct ggaacccgca ctcaaaaaca 
gaccagcgac tcaagcaggt ttaccagttt 
attgcttctt cccccgaccg ctgtataacg 
cccaactcgg ccgcctgtgg actattctgc 
ccccaaactc ccatggatca caaccccacc 
atgctcaaca gtccccaggt acagcccacc 
ttcctggagc gccactcgcc ctacttccgc 
tctttttgtc acttgaaaaa catgtaaaaa 
aaatgctttt atttgtacac tctcgggtga 
gtttaaaaat caaaggggtt ctgccgcgca 
cgatactggt gtttagtgct ccacttaaac 
aagttttcac tccacaggct gcgcaccatc 
atcttgaagt cgcagttggg gcctccgccc 
cagcactgga acactatcag cgccgggtgg 
atcagatccg cgtccaggtc ctccgcgttg 
tgccttccca aaaagggcgc gtgcccaggc 
aaaaggtgac cgtgcccggt ctgggcgtta 
tgcttaaaag ccacctgagc ctttgcgcct 
gaaaactgat tggccggaca ggccgcgtcg 
atctgcacca catttcggcc ccaccggttc 
ttcagcgcgc gctgcccgtt ttcgctcgtc 
atcataatgc ttccgtgtag acacttaagc 
cacaacgcgc agcccgtggg ctcgtgatgc 
tacgcctgca ggaatcgccc catcatcgtc 
tgcaacccgc ggtgctcctc gttcagccag 
tggtcaggca gtagtttgaa gttcgccttt 
agcgcgcgcg cagcctccat gcccttctcc 
ttcatcaccg taatttcact ttccgcttcg 
ataccacgcg ccactgggtc gtcttcattc 
ccatgcttga ttagcaccgg tgggttgctg 
ctttcttcct cgctgtccac gattacctct 
gggcgcttct ttttcttctt gggcgcaatg 
gggctgggtg tgcgcggcac cagcgcgtct 
atacgccgcc tcatccgctt ttttgggggc 
gacgacacgt cctccatggt tgggggacgt 
gtttcgcgct gctcctcttc ccgactggcc 
atggagtcag tcgagaagaa ggacagccta 
tccaccgatg ccgccaacgc gcctaccacc 
gaggaagtga ttatcgagca ggacccaggt 
gtaccaacag aggataaaaa gcaagaccag 
gggcgggggg acgaaaggca tggcgactac 
catctgcagc gccagtgcgc cattatctgc 
ctcgccatag cggatgtcag ccttgcctac 
cccaaacgcc aagaaaacgg cacatgcgag 
tttgccgtgc cagaggtgct tgccacctat 



tgatagcatt tgcctttacg ccaccttctt 20700 
tgaggccatg cttagaaacg acaccaacga 207 60 
caacatgctc taccctatac ccgccaacgc 20820 
caactgggcg gctttccgcg gctgggcctt 20880 
actgggctcg ggctacgacc cttattacac 20940 
aaccttttac ctcaaccaca cctttaagaa 21000 
ctggcctggc aatgaccgcc tgcttacccc 21060 
cggggagggt tacaacgttg cccagtgtaa 21120 
gctagctaac tacaacattg gctaccaggg 21180 
catgtactcc ttctttagaa acttccagcc 21240 
atacaaggac taccaacagg tgggcatcct 21300 
ctaccttgcc cccaccatgc gcgaaggaca 21360 
tataggcaag accgcagttg acagcattac 21420 
ttggcgcatc ccattctcca gtaactttat 21480 
aaaccttctc tacgccaact ccgcccacgc 21540 
ggacgagccc acccttcttt atgttttgtt 21600 
gccgcaccgc ggcgtcatcg aaaccgtgta 21660 
cacaacataa agaagcaagc aacatcaaca 21720 
gaactgaaag ccattgtcaa agatcttggt 21780 
aagcgctttc caggctttgt ttctccacac 21840 
ggtcgcgaga ctgggggcgt acactggatg 21900 
tgctacctct ttgagccctt tggcttttct 21960 
gagtacgagt cactcctgcg ccgtagcgcc 22020 
ctggaaaagt ccacccaaag cgtacagggg 22080 
tgcatgtttc tccacgcctt tgccaactgg 22140 
atgaacctta ttaccggggt acccaactcc 22200 
ctgcgtcgca accaggaaca gctctacagc 22260 
agccacagtg cgcagattag gagcgccact 22320 
taatgtacta gagacacttt caataaaggc 22380 
ttatttaccc ccacccttgc cgtctgcgcc 22440 
tcgctatgcg ccactggcag ggacacgttg 22500 
tcaggcacaa ccatccgcgg cagctcggtg 22560 
accaacgcgt ttagcaggtc gggcgccgat 22620 
tgcgcgcgcg agttgcgata cacagggttg 22680 
tgcacgctgg ccagcacgct cttgtcggag 22740 
ctcagggcga acggagtcaa ctttggtagc 22800 
tttgagttgc actcgcaccg tagtggcatc 22860 
ggatacagcg cctgcataaa agccttgatc 22920 
tcagagaaga acatgccgca agacttgccg 22980 
tgcacgcagc accttgcgtc ggtgttggag 2304 0 
ttcacgatct tggccttgct agactgctcc 23100 
acatccattt caatcacgtg ctccttattt 23160 
tcgccttcga tctcagcgca gcggtgcagc 23220 
ttgtaggtca cctctgcaaa cgactgcagg 23280 
acaaaggtct tgttgctggt gaaggtcagc 23340 
gtcttgcata cggccgccag agcttccact 234 00 
agatcgttat ccacgtggta cttgtccatc 234 60 
cacgcagaca cgatcggcac actcagcggg 23520 
ctgggctctt cctcttcctc ttgcgtccgc 23580 
agccgccgca ctgtgcgctt acctcctttg 23640 
aaacccacca tttgtagcgc cacatcttct 23700 
ggtgatggcg ggcgctcggg cttgggagaa 237 60 
gccaaatccg ccgccgaggt cgatggccgc 23820 
tgtgatgagt cttcctcgtc ctcggactcg 23880 
gcccggggag gcggcggcga cggggacggg 23940 
cgcgccgcac cgcgtccgcg ctcgggggtg 24000 
atttccttct cctataggca gaaaaagatc 24060 
accgccccct ctgagttcgc caccaccgcc 24120 
ttccccgtcg aggcaccccc gcttgaggag 24180 
tttgtaagcg aagacgacga ggaccgctca 24240 
gacaacgcag aggcaaacga ggaacaagtc 24 300 
ctagatgtgg gagacgacgt gctgttgaag 24 360 
gacgcgttgc aagagcgcag cgatgtgccc 24 420 
gaacgccacc tattctcacc gcgcgtaccc 24480 
cccaacccgc gcctcaactt ctaccccgta 24540 
cacatctttt tccaaaactg caagataccc 24 600 
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ctatcctgcc gtgccaaccg cagccgagcg 
gtcatacctg atatcgcctc gctcaacgaa 
gacgagaagc gcgcggcaaa cgctctgcaa 
ggagtgttgg tggaactcga gggtgacaac 
gaggtcaccc actttgccta cccggcactt 
atgagtgagc tgatcgtgcg ccgtgcgcag 
caaacagagg agggcctacc cgcagttggc 
cgcgagcctg ccgacttgga ggagcgacgc 
gtggagcttg agtgcatgca gcggttcttt 
gaaacattgc actacacctt tcgacagggc 
gtggagctct gcaacctggt ctcctacctt 
aacgtgcttc attccacgct caagggcgag 
tacttatttc tatgctacac ctggcagacg 
gagtgcaacc tcaaggagct gcagaaactg 
gccttcaacg agcgctccgt ggccgcgcac 
cttaaaaccc tgcaacaggg tctgccagac 
aggaacttta tcctagagcg ctcaggaatc 
gactttgtgc ccattaagta ccgcgaatgc 
ctgcagctag ccaactacct tgcctaccac 
ggtctactgg agtgtcactg tcgctgcaac 
aattcgcagc tgcttaacga aagtcaaatt 
cctgacgaaa agtccgcggc tccggggttg 
taccttcgca aatttgtacc tgaggactac 
caatcccgcc cgccaaatgc ggagcttacc 
ggccaattgc aagccatcaa caaagcccgc 
gtttacttgg acccccagtc cggcgaggag 
tatcagcagc agccgcgggc ccttgcttcc 
gccgccgcca cccacggacg aggaggaata 
cgaggaggag gaggacatga tggaagactg 
cgaagaggtg tcagacgaaa caccgtcacc 
gaaatcggca accggttcca gcatggctac 
gcccgttcgc cgacccaacc gtagatggga 
gcagccgccg ccgttagccc aagagcaaca 
gcacaagaac gccatagttg cttgcttgca 
ccgctttctt ctctaccatc acggcgtggc 
tcatctctac agcccatact gcaccggcgg 
cacagaagca aaggcgaccg gatagcaaga 
cggcagcagc aggaggagga gcgctgcgtc 
agcttagaaa caggattttt cccactctgt 
aacaagagct gaaaataaaa aacaggtctc 
acaaaagcga agatcagctt cggcgcacgc 
actgcgcgct gactcttaag gactagtttc 
tacgtcatct ccagcggcca cacccggcgc 
aggaaattcc cacgccctac atgtggagtt 
ctgcccaaga ctactcaacc cgaataaact 
gggtcaacgg aatccgcgcc caccgaaacc 
ccacacctcg taataacctt aatccccgta 
gtcccgctcc caccactgtg gtacttccca 
actcaggggc gcagcttgcg ggcggctttc 
taactcacct gacaatcaga gggcgaggta 
cgcttggtct ccgtccggac gggacatttc 
cgcctcgtca ggcaatccta actctgcaga 
ttggaactct gcaatttatt gaggagtttg 
gacctcccgg ccactatccg gatcaattta 
cggacggcta cgactgataa ttaagtggag 
tccactgtcg ccgccacaag tgctttgccc 
tgcccgagga tcatatcgag gatctttgtt 
gaaattaaaa tatactgggg ctcctatcgc 
ccaagcaaac caaggcgaac cttacctggt 
aacagtttca acccagacgg agtgagtcta 
atcagaaaaa acaccaccct ccttacctgc 
ttcctggatg tcagcatctg actttggcca 
actacagcga cccaccctaa cagagatgac 
acttacatct accacaaata caccccaagt 
gggcatgtgg tggttctcca tagcgcttat 
ctgctgccta aagcgcaaac gcgcccgacc 



gacaagcagc tggccttgcg gcagggcgct 24 660 
gtgccaaaaa tctttgaggg tcttggacgc 24720 
caggaaaaca gcgaaaatga aagtcactct 24780 
gcgcgcctag ccgtactaaa acgcagcatc 24840 
aacctacccc ccaaggtcat gagcacagtc 24 900 
cccctggaga gggatgcaaa tttgcaagaa 24 960 
gacgagcagc tagcgcgctg gcttcaaacg 25020 
aaactaatga tggccgcagt gctcgttacc 25080 
gctgacccgg agatgcagcg caagctagag 25140 
tacgtacgcc aggcctgcaa gatctccaac -25200 
ggaattttgc acgaaaaccg ccttgggcaa 25260 
gcgcgccgcg actacgtccg cgactgcgtt 25320 
gccatgggcg tttggcagca gtgcttggag 25380 
ctaaagcaaa acttgaagga cctatggacg 254 40 
ctggcggaca tcattttccc cgaacgcctg 25500 
ttcaccagtc aaagcatgtt gcagaacttt 25560 
ttgcccgcca cctgctgtgc acttcctagc 25620 
cctccgccgc tttggggcca ctgctacctt 25680 
tctgacataa tggaagacgt gagcggtgac 25740 
ctatgcaccc cgcaccgctc cctggtttgc 25800 
atcggtacct ttgagctgca gggtccctcg 25860 
aaactcactc cggggctgtg gacgtcggct 25920 
cacgcccacg agattaggtt ctacgaagac 25980 
gcctgcgtca ttacccaggg ccacattctt 2 604 0 
caagagtttc tgctacgaaa gggacggggg 26100 
ctcaacccaa tccccccgcc gccgcagccc 26160 
caggatggca cccaaaaaga agctgcagct 26220 
ctgggacagt caggcagagg aggttttgga 26280 
ggagagccta gacgaggaag cttccgaggt 26340 
ctcggtcgca ttcccctcgc cggcgcccca 26400 
aacctccgct cctcaggcgc cgccggcact 26460 
caccactgga accagggccg gtaagtccaa 26520 
acagcgccaa ggctaccgct catggcgcgg 26580 
agactgtggg ggcaacatct ccttcgcccg 26640 
cttcccccgt aacatcctgc attactaccg 26700 
cagcggcagc ggcagcaaca gcagcggcca 26760 
ctctgacaaa gcccaagaaa tccacagcgg 26820 
tggcgcccaa cgaacccgta tcgacccgcg 26880 
atgctatatt tcaacagagc aggggccaag 2 694 0 
tgcgatccct cacccgcagc tgcctgtatc 27000 
tggaagacgc ggaggctctc ttcagtaaat 27060 
gcgccctttc tcaaatttaa gcgcgaaaac 27120 
cagcacctgt cgtcagcgcc attatgagca 27180 
accagccaca aatgggactt gcggctggag 27240 
acatgagcgc gggaccccac atgatatccc 27300 
gaattctctt ggaacaggcg gctattacca 27360 
gttggcccgc tgccctggtg taccaggaaa 27420 
gagacgccca ggccgaagtt cagatgacta 27480 
gtcacagggt gcggtcgccc gggcagggta 27540 
ttcagctcaa cgacgagtcg gtgagctcct 27600 
agatcggcgg cgccggccgt ccttcattca 27 660 
cctcgtcctc tgagccgcgc tctggaggca 27720 
tgccatcggt ctactttaac cccttctcgg 27780 
ttcctaactt tgacgcggta aaggactcgg 27840 
aggcagagca actgcgcctg aaacacctgg 27900 
gcgactccgg tgagttttgc tactttgaat 27960 
gccatctctg tgctgagtat aataaataca 28020 
catcctgtaa acgccaccgt cttcacccgc 28080 
acttttaaca tctctccctc tgtgatttac 28140 
cgagagaacc tctccgagct cagctactcc 28200 
cgggaacgta cccttaatta aaagtcaggc 28260 
gcacctgtcc cgcggatttg ttccagtcca 28320 
caacacaacc aacgcggccg ccgctaccgg 28380 
ttctgccttt gtcaataact gggataactt 28440 
gtttgtatgc cttattatta tgtggctcat 28500 
acccatctat agtcccatca ttgtgctaca 28560 
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cccaaacaat gatggaatcc atagattgga 
agtatgatta aatgagatta attaaggaat 
tgccctcctc ccagctctgg tattgcagct 
taaatggaat gtcagtttcc tcctgttcct 
tgcagatgaa gcgcgcaaga ccgtctgaag 
cggaaaccgg tcctccaact gtgccttttc 
ttcaagagag tccccctggg gtactctctt 
atggcatgct tgcgctcaaa atgggcaacg 
cctcccaaaa tgtaaccact gtgagcccac 
tggaaatatc tgcacccctc acagttacct 
ctctaatggt cgcgggcaac acactcacca 
actccaaact tagcattgcc acccaaggac 
tgcaaacatc aggccccctc accaccaccg 
cccctctaac tactgccact ggtagcttgg 
aaaatggaaa actaggacta aagtacgggg 
ctttgaccgt agcaactggt ccaggtgtga 
ttactggagc cttgggtttt gattcacaag 
taaggattga ttctcaaaac agacgcctta 
aaaaccaact aaatctaaga ctaggacagg 
tggatattaa ctacaacaaa ggcctttact 
ttgaggttaa cctaagcact gccaaggggt 
atgcaggaga tgggcttgaa tttggttcac 
caaaaattgg ccatggccta gaatttgatt 
ctggccttag ttttgacagc acaggtgcca 
taactttgtg gaccacacca gctccatctc 
ctaaactcac tttggtctta acaaaatgtg 
tggctgttaa aggcagtttg gctccaatat 
taagatttga cgaaaatgga gtgctactaa 
actttagaaa tggagatctt actgaaggca 
ctaacctatc agcttatcca aaatctcacg 
aagtttactt aaacggagac aaaactaaac 
cacaggaaac aggagacaca actccaagtg 
ctggccacaa ctacattaat gaaatatttg 
cccaagaata aagaatcgtt tgtgttatgt 
atttcaagtc atttttcatt cagtagtata 
ccgtacctta atcaaactca cagaacccta 
acagagtaca cagtcctttc tccccggctg 
gacatattct taggtgttat attccacacg 
atattaataa actccccggg cagctcactt 
acaggctgct gtccaacttg cggttgctta 
atgggggtag agtcataatc gtgcatcagg 
ataaactgct gccgccgccg ctccgtcctg 
gcgatgattc gcaccgcccg cagcataagg 
ctgatctcac ttaaatcagc acagtaactg 
ccacagtgca aggcgctgta tccaaagctc 
tcataccaca agcgcaggta gattaagtgg 
attacctctt ttggcatgtt gtaattcacc 
aacatggcgc catccaccac catcctaaac 
cactgcaggg aaccgggact ggaacaatga 
atcatcatgc tcgtcatgat atcaatgttg 
ctcaggatta caagctcctc ccgcgttaga 
atcagcgtaa atcccacact gcagggaaga 
aaagtgttac attcgggcag cagcggatga 
tcaaaaggag gtagacgatc cctactgtac 
ggtcgtagtg tcatgccaaa tggaacgccg 
ggtgcgggcg tgacaaacag atctgcgtct 
gtagttgtag tatatccact ctctcaaagc 
gtaaactcct tcatgcgccg ctgccctgat 
cagccaacct acacattcgt tctgcgagtc 
aaccatgttt ttttttttat tccaaaagat 
agtgaacgcg ctcccctccg gtggcgtggt 
catttgtaag atgttgcaca atggcttcca 
cgtaaaggct aaacccttca gggtgaatct 
tgcccaaata attctcatct cgccaccttc 
taagtccggc cattgtaaaa atctgctcca 
gaatcatgat tgcaaaaatt caggttcctc 



cggactgaaa cacatgttct tttctcttac 28620 
ttctgtccag tttattcagc agcacctcct 28680 
tcctcctggc tgcaaacttt ctccacaatc 28740 
gtccatccgc acccactatc ttcatgttgt 28800 
ataccttcaa ccccgtgtat ccatatgaca 28860 
ttactcctcc ctttgtatcc cccaatgggt 28920 
tgcgcctatc cgaacctcta gttacctcca 28980 
gcctctctct ggacgaggcc ggcaacctta 29040 
ctctcaaaaa aaccaagtca aacataaacc 29100 
cagaagccct aactgtggct gccgccgcac 29160 
tgcaatcaca ggccccgcta accgtgcacg 29220 
ccctcacagt gtcagaagga aagctagccc 29280 
atagcagtac ccttactatc actgcctcac 29340 
gcattgactt gaaagagccc atttatacac 29400 
ctcctttgca tgtaacagac gacctaaaca 294 60 
ctattaataa tacttccttg caaactaaag 29520 
gcaatatgca acttaatgta gcaggaggac 29580 
tacttgatgt tagttatccg tttgatgctc 29640 
gccctctttt tataaactca gcccacaact 29700 
tgtttacagc ttcaaacaat tccaaaaagc 29760 
tgatgtttga cgctacagcc atagccatta 29820 
ctaatgcacc aaacacaaat cccctcaaaa 29880 
caaacaaggc tatggttcct aaactaggaa 29940 
ttacagtagg aaacaaaaat aatgataagc 30000 
ctaactgtag actaaatgca gagaaagatg 30060 
gcagtcaaat acttgctaca gtttcagttt 30120 
ctggaacagt tcaaagtgct catcttatta 30180 
acaattcctt cctggaccca gaatattgga 30240 
cagcctatac aaacgctgtt ggatttatgc 30300 
gtaaaactgc caaaagtaac attgtcagtc 30360 
ctgtaacact aaccattaca ctaaacggta 30420 
catactctat gtcattttca tgggactggt 30480 
ccacatcctc ttacactttt tcatacattg 30540 
ttcaacgtgt ttatttttca attgcagaaa 30600 
gccccaccac cacatagctt atacagatca 30660 
gtattcaacc tgccacctcc ctcccaacac 30720 
gccttaaaaa gcatcatatc atgggtaaca 30780 
gtttcctgtc gagccaaacg ctcatcagtg 30840 
aagttcatgt cgctgtccag ctgctgagcc 30900 
acgggcggcg aaggagaagt ccacgcctac 30960 
atagggcggt ggtgctgcag cagcgcgcga 31020 
caggaataca acatggcagt ggtctcctca 31080 
cgccttgtcc tccgggcaca gcagcgcacc 31140 
cagcacagca ccacaatatt gttcaaaatc 31200 
atggcgggga ccacagaacc cacgtggcca 31260 
cgacccctca taaacacgct ggacataaac 31320 
acctcccggt accatataaa cctctgatta 31380 
cagctggcca aaacctgccc gccggctata 314 40 
cagtggagag cccaggactc gtaaccatgg 31500 
gcacaacaca ggcacacgtg catacacttc 31560 
accatatccc agggaacaac ccattcctga 31620 
cctcgcacgt aactcacgtt gtgcattgtc 31680 
tcctccagta tggtagcgcg ggtttctgtc 31740 
ggagtgcgcc gagacaaccg agatcgtgtt 31800 
gacgtagtca tatttcctga agcaaaacca 31860 
ccggtctcgc cgcttagatc gctctgtgta 31920 
atccaggcgc cccctggctt cgggttctat 31980 
aacatccacc accgcagaat aagccacacc 32040 
acacacggga ggagcgggaa gagctggaag 32100 
tatccaaaac ctcaaaatga agatctatta 32160 
caaactctac agccaaagaa cagataatgg 32220 
aaaggcaaac ggccctcacg tccaagtgga 32280 
cctctataaa cattccagca ccttcaacca 32340 
tcaatatatc tctaagcaaa tcccgaatat 32400 
gagcgccctc caccttcagc ctcaagcagc 324 60 
acagacctgt ataagattca aaagcggaac 32520 
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attaacaaaa ataccgcgat cccgtaggtc 

caggtctgca cggaccagcg cggccacttc 

actgattatg acacgcatac tcggagctat 

gttgcatggg cggcgatata aaatgcaagg 

gcaaaaaaga aagcacatcg tagtcatgct 

ccaccacaga aaaagacacc atttttctct 

caaaataaaa taacaaaaaa acatttaaac 

aacccttata agcataagac ggactacggc 

accgtgatta aaaagcacca ccgacagctc 

ctcggtaaac acatcaggtt gattcatcgg 

ggggaataca tacccgcagg cgtagagaca 

aattaatagg agagaaaaac acataaacac 

caccctcccg ctccagaaca acatacagcg 

accagtaaaa aagaaaacct attaaaaaaa 

tcacagtgta aaaaagggcc aagtgcagag 

acggttaaag tccacaaaaa acacccagaa 

aagccaaaaa acccacaact tcctcaaatc 

tcccatttta agaaaactac aattcccaac 

gtcacccgcc ccgttcccac gccccgcgcc 

attggcttca atccaaaata aggtatatta 

<210> 4 
<211> 34448 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 4 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatat 
aatggccgcc agtcttttgg accagctgat 
tcctagccat tttgaaccac ctacccttca 
cgaagatccc aacgaggagg cggtttcgca 
gcaggaaggg attgacttac tcacttttcc 
cctttcccgg cagcccgagc agccggagca 
ccttgtaccg gaggtgatcg atcttacctg 
cgaggatgaa gagggtgagg agtttgtgtt 
caggtcttgt cattatcacc ggaggaatac 
ctatatgagg acctgtggca tgtttgtcta 
tagagtggtg ggtttggtgt ggtaattttt 
gaattttgta ttgtgatttt tttaaaaggt 
ccagaaccgg agcctgcaag acctacccgc 
cgcccgacat cacctgtgtc tagagaatgc 
ccttctaaca cacctcctga gatacacccg 
gccgtgagag ttggtgggcg tcgccaggct 
cctgggcaac ctttggactt gagctgtaaa 
ttgcgtgtgt ggttaacgcc tttgtttgct 
gagataatgt ttaacttgca tggcgtgtta 
cgccgtgggc taatcttggt tacatctgac 
ttttctgctg tgcgtaactt gctggaacag 
tttctgtggg gctcatccca ggcaaagtta 
gaatttgaag agcttttgaa atcctgtggt 
caggcgcttt tccaagagaa ggtcatcaag 
gcggctgctg ttgctttttt gagttttata 
agcggggggt acctgctgga ttttctggcc 
aagaatcgcc tgctactgtt gtcttccgtc 
cagcagcagc aggaggaagc caggcggcgg 
gccggcctgg accctcggga atgaatgttg 
gacgcatttt gacaattaca gaggatgggc 



ccttcgcagg gccagctgaa cataatcgtg 32580 

cccgccagga accttgacaa aagaacccac 32640 

gctaaccagc gtagccccga tgtaagcttt 32700 

tgctgctcaa aaaatcaggc aaagcctcgc 327 60 

catgcagata aaggcaggta agctccggaa 32820 

caaacatgtc tgcgggtttc tgcataaaca 32880 

attagaagcc tgtcttacaa caggaaaaac 32940 

catgccggcg tgaccgtaaa aaaactggtc 33000 

ctcggtcatg tccggagtca taatgtaaga 33060 

tcagtgctaa aaagcgaccg aaatagcccg 33120 

acattacagc ccccatagga ggtataacaa 33180 

ctgaaaaacc ctcctgccta ggcaaaatag 33240 

cttcacagcg gcagcctaac agtcagcctt 33300 

caccactcga cacggcacca gctcaatcag 333 60 

cgagtatata taggactaaa aaatgacgta 33420 

aaccgcacgc gaacctacgc ccagaaacga 33480 

gtcacttccg ttttcccacg ttacgtaact 33540 

acatacaagt tactccgccc taaaacctac 33600 

acgtcacaaa ctccaccccc tcattatcat 33660 

ttgatgatg 33699 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
tatctgccac ggaggtgtta ttaccgaaga 600 
cgaagaggta ctggctgata atcttccacc 660 
cgaactgtat gatttagacg tgacggcccc 720 
gatttttccc gactctgtaa tgttggcggt 780 
gccggcgccc ggttctccgg agccgcctca 840 
gagagccttg ggtccggttt ctatgccaaa 900 
ccacgaggct ggctttccac ccagtgacga 960 
agattatgtg gagcaccccg ggcacggttg 1020 
gggggaccca gatattatgt gttcgctttg 1080 
cagtaagtga aaattatggg cagtgggtga 1140 
tttttaattt ttacagtttt gtggtttaaa 1200 
cctgtgtctg aacctgagcc tgagcccgag 1260 
cgtcctaaaa tggcgcctgc tatcctgaga 1320 
aatagtagta cggatagctg tgactccggt 1380 
gtggtcccgc tgtgccccat taaaccagtt 1440 
gtggaatgta tcgaggactt gcttaacgag 1500 
cgccccaggc cataaggtgt aaacctgtga 1560 
gaatgagttg atgtaagttt aataaagggt 1620 
aatggggcgg ggcttaaagg gtatataatg 1680 
ctcatggagg cttgggagtg tttggaagat 1740 
agctctaaca gtacctcttg gttttggagg 1800 
gtctgcagaa ttaaggagga ttacaagtgg 1860 
gagctgtttg attctttgaa tctgggtcac 1920 
actttggatt tttccacacc ggggcgcgct 1980 
aaggataaat ggagcgaaga aacccatctg 2040 
atgcatctgt ggagagcggt tgtgagacac 2100 
cgcccggcga taataccgac ggaggagcag 2160 
cggcaggagc agagcccatg gaacccgaga 2220 
tacaggtggc tgaactgtat ccagaactga 2280 
aggggctaaa gggggtaaag agggagcggg 2340 
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gggcttgtga ggctacagag gaggctagga 
gtcctgagtg tattactttt caacagatca 
tggcgcagaa gtattccata gagcagctga 
ttgaggaggc tattagggta tatgcaaagg 
tcagcaaact tgtaaatatc aggaattgtt 
agatagatac ggaggatagg gtggccttta 
tgcttggcat ggacggggtg gttattatga 
gtacggtttt cctggccaat accaacctta 
acaatacctg tgtggaagcc tggaccgatg 
gctggaaggg ggtggtgtgt cgccccaaaa 
aaaggtgtac cttgggtatc ctgtctgagg 
ccgactgtgg ttgcttcatg ctagtgaaaa 
gtggcaactg cgaggacagg gcctctcaga 
tgctgaagac cattcacgta gccagccact 
acatactgac ccgctgttcc ttgcatttgg 
aatgcaattt gagtcacact aagatattgc 
tgaacggggt gtttgacatg accatgaaga 
gcaccaggtg cagaccctgc gagtgtggcg 
tggatgtgac cgaggagctg aggcccgatc 
ttggctctag cgatgaagat acagattgag 
tgggaaagaa tatataaggt gggggtctta 
ccgccgccat gagcaccaac tcgtttgatg 
gcatgccccc atgggccggg gtgcgtcaga 
ccgtcctgcc cgcaaactct actaccttga 
agactgcagc ctccgccgcc gcttcagccg 
actttgcttt cctgagcccg cttgcaagca 
acaagttgac ggctcttttg gcacaattgg 
ctcagcagct gttggatctg cgccagcagg 
atgcggttta aaacataaat aaaaaaccag 
cttgctgtct ttatttaggg gttttgcgcg 
cgttgagggt cctgtgtatt ttttccagga 
acatgggcat aagcccgtct ctggggtgga 
gggtggtgtt gtagatgatc cagtcgtagc 
ctttcagtag caagctgatt gccaggggca 
taagctggga tgggtgcata cgtggggata 
tggctatgtt cccagccata tccctccggg 
tgtatccggt gcacttggga aatttgtcat 
tggagacgcc cttgtgacct ccaagatttt 
gcccacgggc ggcggcctgg gcgaagatat 
ccaggatgag atcgtcatag gccattttta 
gtataatggt tccatccggc ccaggggcgt 
ctttgagttc agatgggggg atcatgtcta 
gggtagggga gatcagctgg gaagaaagca 
cggtgggccc gtaaatcaca cctattaccg 
tgccgtcatc cctgagcagg ggggccactt 
ccctgaccaa atccgccaga aggcgctcgc 
caaagttttt caacggtttg agaccgtccg 
gcagttccag gcggtcccac agctcggtca 
ctcctcgttt cgcgggttgg ggcggctttc 
acgggccagg gtcatgtctt tccacgggcg 
ggtgaagggg tgcgctccgg gctgcgcgct 
ggtgctgaag cgctgccggt cttcgccctg 
gtcatagtcc agcccctccg cggcgtggcc 
gccgcacgag gggcagtgca gacttttgag 
ttccggggag taggcatccg cgccgcaggc 
ggtgagctct ggccgttcgg ggtcaaaaac 
cttacctctg gtttccatga gccggtgtcc 
cccgtataca gacttgagag gcctgtcctc 
aaactcggac cactctgaga caaaggctcg 
ggaggggtag cggtcgttgt ccactagggg 
gtcgccctct tcggcatcaa ggaaggtgat 
tgttcctgaa ggggggctat aaaagggggt 
atcgctgtct gcgagggcca gctgttgggg 
ttctgcgcta agattgtcag tttccaaaaa 
ggtgatgcct ttgagggtgg ccgcatccat 
aagcttggtg gcaaacgacc cgtagagggc 



atctagcttt tagctcaatg accagacacc 2400 
aggataattg cgctaatgag cttgatctgc 24 60 
ccacttactg gctgcagcca ggggatgatt 2520 
tggcacttag gccagattgc aagtacaaga 2580 
gctacatttc tgggaacggg gccgaggtgg 2640 
gatgtagcat gataaatatg tggccggggg 2700 
atgtaaggtt tactggcccc aattttagcg 2760 
tcctacacgg tgtaagcttc tatgggttta 2820 
taagggttcg gggctgtgcc ttttactgct 2880 
gcagggcttc aattaagaaa tgcctctttg 2940 
gtaactccag ggtgcgccac aatgtggcct 3000 
gcgtggctgt gattaagcat aacatggtat 3060 
tgctgacctg ctcggacggc aactgtcacc 3120 
ctcgcaaggc ctggccagtg tttgagcata 3180 
gtaacaggag gggggtgttc ctaccttacc 3240 
ttgagcccga gagcatgtcc aaggtgaacc 3300 
tctggaaggt gctgaggtac gatgagaccc 3360 
gtaaacatat taggaaccag cctgtgatgc 3420 
acttggtgct ggcctgcacc cgcgctgagt 3480 
gtactgaaat gtgtgggcgt ggcttaaggg 354 0 
tgtagttttg tatctgtttt gcagcagccg 3600 
gaagcattgt gagctcatat ttgacaacgc 3660 
atgtgatggg ctccagcatt gatggtcgcc 3720 
cctacgagac cgtgtctgga acgccgttgg 3780 
ctgcagccac cgcccgcggg attgtgactg 3840 
gtgcagcttc ccgttcatcc gcccgcgatg 3900 
attctttgac ccgggaactt aatgtcgttt 3960 
tttctgccct gaaggcttcc tcccctccca 4020 
actctgtttg gatttggatc aagcaagtgt 4080 
cgcggtaggc ccgggaccag cggtctcggt 4140 
cgtggtaaag gtgactctgg atgttcagat 4200 
ggtagcacca ctgcagagct tcatgctgcg 4260 
aggagcgctg ggcgtggtgc ctaaaaatgt 4320 
ggcccttggt gtaagtgttt acaaagcggt 4380 
tgagatgcat cttggactgt atttttaggt 4440 
gattcatgtt gtgcagaacc accagcacag 4500 
gtagcttaga aggaaatgcg tggaagaact 4560 
ccatgcattc gtccataatg atggcaatgg 4 620 
ttctgggatc actaacgtca tagttgtgtt 4680 
caaagcgcgg gcggagggtg ccagactgcg 4740 
agttaccctc acagatttgc atttcccacg 4800 
cctgcggggc gatgaagaaa acggtttccg 4860 
ggttcctgag cagctgcgac ttaccgcagc 4920 
ggtgcaactg gtagttaaga gagctgcagc 4 980 
cgttaagcat gtccctgact cgcatgtttt 5040 
cgcccagcga tagcagttct tgcaaggaag 5100 
ccgtaggcat gcttttgagc gtttgaccaa 5160 
cctgctctac ggcatctcga tccagcatat 5220 
gctgtacggc agtagtcggt gctcgtccag 5280 
cagggtcctc gtcagcgtag tctgggtcac 5340 
ggccagggtg cgcttgaggc tggtcctgct 5400 
cgcgtcggcc aggtagcatt tgaccatggt 5460 
cttggcgcgc agcttgccct tggaggaggc 5520 
ggcgtagagc ttgggcgcga gaaataccga 5580 
cccgcagacg gtctcgcatt ccacgagcca 5640 
caggtttccc ccatgctttt tgatgcgttt 5700 
acgctcggtg acgaaaaggc tgtccgtgtc 57 60 
gagcggtgtt ccgcggtcct cctcgtatag 5820 
cgtccaggcc agcacgaagg aggctaagtg 5880 
gtccactcgc tccagggtgt gaagacacat 5940 
tggtttgtag gtgtaggcca cgtgaccggg 6000 
gggggcgcgt tcgtcctcac tctcttccgc 6060 
tgagtactcc ctctgaaaag cgggcatgac 6120 
cgaggaggat ttgatattca cctggcccgc 6180 
ctggtcagaa aagacaatct ttttgttgtc 6240 
gttggacagc aacttggcga tggagcgcag 6300 
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ggtttggttt ttgtcgcgat cggcgcgctc 
gcgcgcaacg caccgccatt cgggaaagac 
gcgccaaccg cggttgtgca gggtgacaag 
gcgctcgttg gtccagcaga ggcggccgcc 
tagctgcgtc tcgtccgggg ggtctgcgtc 
gtcgaagtag tctatcttgc atccttgcaa 
aagcgcgcgc tcgtatgggt tgagtggggg 
ggcgtacatg ccgcaaatgt cgtaaacgta 
agggtagcat cttccaccgc ggatgctggc 
agcgaggagg tcgggaccga ggttgctacg 
cctgaagatg gcatgtgagt tggatgatat 
gtctgtgaga cctaccgcgt cacgcacgaa 
cagctcggcg gtgacctgca cgtctagggc 
atacttatcc tgtccctttt ttttccacag 
tttccagtac tcttggatcg gaaacccgtc 
gaactggttg acggcctggt aggcgcagca 
cgcggccttc cggagcgagg tgtgggtgag 
gtactggtat ttgaagtcag tgtcgtcgca 
gcgctttttg gaacgcggat ttggcagggc 
cgcgcgaggc ataaagttgc gtgtgatgcg 
aattacctgg gcggcgagca cgatctcgtc 
aagttccaag aagcgcggga tgcccttgat 
gagctcttca ggggagctga gcccgtgctc 
ggaagcgacg aatgagctcc acaggtcacg 
ggtcctaaac tggcgaccta tggccatttt 
gtcttgttcc cagcggtccc atccaaggtt 
aggctcatct ccgccgaact tcatgaccag 
ccccatccaa gtataggtct ctacatcgta 
cgagccgatc gggaagaact ggatctcccg 
gtgaaagtag aagtccctgc gacgggccga 
gcagtactgg cagcggtgca cgggctgtac 
cacaaggaag cagagtggga atttgagccc 
tacttcggct gcttgtcctt gaccgtctgg 
caccacgccg cgcgagccca aagtccagat 
aacatcgcgc agatgggagc tgtccatggt 
gagctcctgc aggtttacct cgcatagacg 
cctaatttcc aggggctggt tggtggcggc 
cggcgcgact acggtaccgc gcggcgggcg 
atctaaaagc ggtgacgcgg gcgagccccc 
agagggggca ggggcacgtc ggcgccgcgc 
ttgctggcga acgcgacgac gcggcggttg 
acgacgggcc cggtgagctt gagcctgaaa 
ttgacggcgg cctggcgcaa aatctcctgc 
tcggccatga actgctcgat ctcttcctcc 
gtggcggcga ggtcgttgga aatgcgggcc 
tcgttccaga cgcggctgta gaccacgccc 
tgcgcgagat tgagctccac gtgccgggcg 
aggtagttga gggtggtggc ggtgtgttct 
aacgtggatt cgttgatatc ccccaaggcc 
acggcgaagt tgaaaaactg ggagttgcgc 
cggatgagct cggcgacagt gtcgcgcacc 
tcttcttcaa tctcctcttc cataagggcc 
ggagggggga cacggcggcg acgacggcgc 
atctccccgc ggcgacggcg catggtctcg 
agttggaaga cgccgcccgt catgtcccgg 
agggatacgg cgctaacgat gcatctcaac 
gacctgagcg agtccgcatc gaccggatcg 
tcacagtcgc aaggtaggct gagcaccgtg 
tttctggcgg aggtgctgct gatgatgtaa 
gtcgacagaa gcaccatgtc cttgggtccg 
ccccaggctt cgttttgaca tcggcgcagg 
accggcactt cttcttctcc ttcctcttgt 
gcggcggagt ttggccgtag gtggcgccct 
ctcatcggct gaagcagggc taggtcggcg 
acctgcgtga gggtagactg gaagtcatcc 
ttgatggtgt aagtgcagtt ggccataacg 



cttggccgcg atgtttagct gcacgtattc 6360 
ggtggtgcgc tcgtcgggca ccaggtgcac 6420 
ctcaacgctg gtggctacct ctccgcgtag 6480 
cttgcgcgag cagaatggcg gtagggggtc 6540 
cacggtaaag accccgggca gcaggcgcgc 6600 
gtctagcgcc tgctgccatg cgcgggcggc 6660 
accccatggc atggggtggg tgagcgcgga 6720 
gaggggctct ctgagtattc caagatatgt 6780 
gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 
ggcgggctgc tctgctcgga agactatctg 6900 
ggttggacgc tggaagacgt tgaagctggc 6960 
ggaggcgtag gagtcgcgca gcttgttgac 7020 
gcagtagtcc agggtttcct tgatgatgtc 7080 
ctcgcggttg aggacaaact cttcgcggtc 7140 
ggcctccgaa cggtaagagc ctagcatgta 7200 
tcccttttct acgggtagcg cgtatgcctg 7260 
cgcaaaggtg tccctgacca tgactttgag 7320 
tccgccctgc tcccagagca aaaagtccgt 7380 
gaaggtgaca tcgttgaaga gtatctttcc 74 40 
gaagggtccc ggcacctcgg aacggttgtt 7500 
aaagccgttg atgttgtggc ccacaatgta 7560 
ggaaggcaat tttttaagtt cctcgtaggt 7620 
tgaaagggcc cagtctgcaa gatgagggtt 7680 
ggccattagc atttgcaggt ggtcgcgaaa 7740 
ttctggggtg atgcagtaga aggtaagcgg 7800 
cgcggctagg tctcgcgcgg cagtcactag 78 60 
catgaagggc acgagctgct tcccaaaggc 7920 
ggtgacaaag agacgctcgg tgcgaggatg 7980 
ccaccaattg gaggagtggc tattgatgtg 8040 
acactcgtgc tggcttttgt aaaaacgtgc 8100 
atcctgcacg aggttgacct gacgaccgcg 8160 
ctcgcctggc gggtttggct ggtggtcttc 8220 
ctgctcgagg ggagttacgg tggatcggac 8280 
gtccgcgcgc ggcggtcgga gcttgatgac 8340 
ctggagctcc cgcggcgtca ggtcaggcgg 84 00 
ggtcagggcg cgggctagat ccaggtgata 84 60 
gtcgatggct tgcaagaggc cgcatccccg 8520 
gtgggccgcg ggggtgtcct tggatgatgc 8580 
ggaggtaggg ggggctccgg acccgccggg 864 0 
gcgggcagga gctggtgctg cgcgcgtagg 8700 
atctcctgaa tctggcgcct ctgcgtgaag 87 60 
gagagttcga cagaatcaat ttcggtgtcg 8820 
acgtctcctg agttgtcttg ataggcgatc 8880 
tggagatctc cgcgtccggc tcgctccacg 8940 
atgagctgcg agaaggcgtt gaggcctccc 9000 
ccttcggcat cgcgggcgcg catgaccacc 9060 
aagacggcgt agtttcgcag gcgctgaaag 9120 
gccacgaaga agtacataac ccagcgtcgc 9180 
tcaaggcgct ccatggcctc gtagaagtcc 9240 
gccgacacgg ttaactcctc ctccagaaga 9300 
tcgcgctcaa aggctacagg ggcctcttct 9360 
tccccttctt cttcttctgg cggcggtggg 9420 
accgggaggc ggtcgacaaa gcgctcgatc 9480 
gtgacggcgc ggccgttctc gcgggggcgc 9540 
ttatgggttg gcggggggct gccatgcggc 9600 
aattgttgtg taggtactcc gccgccgagg 9660 
gaaaacctct cgagaaaggc gtctaaccag 9720 
gcgggcggca gcgggcggcg gtcggggttg 9780 
ttaaagtagg cggtcttgag acggcggatg 9840 
gcctgctgaa tgcgcaggcg gtcggccatg 9900 
tctttgtagt agtcttgcat gagcctttct 9960 
cctgcatctc ttgcatctat cgctgcggcg 10020 
cttcctccca tgcgtgtgac cccgaagccc 10080 
acaacgcgct cggctaatat ggcctgctgc 10140 
atgtccacaa agcggtggta tgcgcccgtg 10200 
gaccagttaa cggtctggtg acccggctgc 10260 
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gagagctcgg tgtacctgag acgcgagtaa gccctcgagt caaatacgta gtcgttgcaa 10320 
gtccgcacca ggtactggta tcccaccaaa aagtgcggcg gcggctggcg gtagaggggc 10380 
cagcgtaggg tggccggggc tccgggggcg agatcttcca acataaggcg atgatatccg 10440 
tagatgtacc tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg cggaaagtcg 10500 
cggacgcggt tccagatgtt gcgcagcggc aaaaagtgct ccatggtcgg gacgctctgg 10560 
ccggtcaggc gcgcgcaatc gttgacgctc tagaccgtgc aaaaggagag cctgtaagcg 10620 
ggcactcttc cgtggtctgg tggataaatt cgcaagggta tcatggcgga cgaccggggt 10680 
tcgagccccg tatccggccg tccgccgtga tccatgcggt taccgcccgc gtgtcgaacc 10740 
caggtgtgcg acgtcagaca acgggggagt gctccttttg gcttccttcc aggcgcggcg 10800 
gctgctgcgc tagctttttt ggccactggc cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gcgaaagcat taagtggctc gctccctgta gccggagggt tattttccaa gggttgagtc 10920 
gcgggacccc cggttcgagt ctcggaccgg ccggactgcg gcgaacgggg gtttgcctcc 10980 
ccgtcatgca agaccccgct tgcaaattcc tccggaaaca gggacgagcc ccttttttgc 11040 
ttttcccaga tgcatccggt gctgcggcag atgcgccccc ctcctcagca gcggcaagag 11100 
caagagcagc ggcagacatg cagggcaccc tcccctcctc ctaccgcgtc aggaggggcg 11160 
acatccgcgg ttgacgcggc agcagatggt gattacgaac ccccgcggcg ccgggcccgg 11220 
cactacctgg acttggagga gggcgagggc ctggcgcggc taggagcgcc ctctcctgag 11280 
cggtacccaa gggtgcagct gaagcgtgat acgcgtgagg cgtacgtgcc gcggcagaac 11340 
ctgtttcgcg accgcgaggg agaggagccc gaggagatgc gggatcgaaa gttccacgca 11400 
gggcgcgagc tgcggcatgg cctgaatcgc gagcggttgc tgcgcgagga ggactttgag 11460 
cccgacgcgc gaaccgggat tagtcccgcg cgcgcacacg tggcggccgc cgacctggta 11520 
accgcatacg agcagacggt gaaccaggag attaactttc aaaaaagctt taacaaccac 11580 
gtgcgtacgc ttgtggcgcg cgaggaggtg gctataggac tgatgcatct gtgggacttt 11640 
gtaagcgcgc tggagcaaaa cccaaatagc aagccgctca tggcgcagct gttccttata 11700 
gtgcagcaca gcagggacaa cgaggcattc agggatgcgc tgctaaacat agtagagccc 117 60 
gagggccgct ggctgctcga tttgataaac atcctgcaga gcatagtggt gcaggagcgc 11820 
agcttgagcc tggctgacaa ggtggccgcc atcaactatt ccatgcttag cctgggcaag 11880 
ttttacgccc gcaagatata ccatacccct tacgttccca tagacaagga ggtaaagatc 11940 
gaggggttct acatgcgcat ggcgctgaag gtgcttacct tgagcgacga cctgggcgtt 12000 
tatcgcaacg agcgcatcca caaggccgtg agcgtgagcc ggcggcgcga gctcagcgac 12060 
cgcgagctga tgcacagcct gcaaagggcc ctggctggca cgggcagcgg cgatagagag 12120 
gccgagtcct actttgacgc gggcgctgac ctgcgctggg ccccaagccg acgcgccctg 12180 
gaggcagctg gggccggacc tgggctggcg gtggcacccg cgcgcgctgg caacgtcggc 12240 
ggcgtggagg aatatgacga ggacgatgag tacgagccag aggacggcga gtactaagcg 12300 
gtgatgtttc tgatcagatg atgcaagacg caacggaccc ggcggtgcgg gcggcgctgc 12360 
agagccagcc gtccggcctt aactccacgg acgactggcg ccaggtcatg gaccgcatca 12420 
tgtcgctgac tgcgcgcaat cctgacgcgt tccggcagca gccgcaggcc aaccggctct 12480 
ccgcaattct ggaagcggtg gtcccggcgc gcgcaaaccc cacgcacgag aaggtgctgg 12540 
cgatcgtaaa cgcgctggcc gaaaacaggg ccatccggcc cgacgaggcc ggcctggtct 12600 
acgacgcgct gcttcagcgc gtggctcgtt acaacagcgg caacgtgcag accaacctgg 12660 
accggctggt gggggatgtg cgcgaggccg tggcgcagcg tgagcgcgcg cagcagcagg 12720 
gcaacctggg ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc 12780 
cgcggggaca ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga 12840 
caccgcaaag tgaggtgtac cagtctgggc cagactattt tttccagacc agtagacaag 12900 
gcctgcagac cgtaaacctg agccaggctt tcaaaaactt gcaggggctg tggggggtgc 12960 
gggctcccac aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt 13020 
tgctgctgct aatagcgccc ttcacggaca gtggcagcgt gtcccgggac acatacctag 13080 
gtcacttgct gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt 13140 
tccaggagat tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg 13200 
caaccctaaa ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa 13260 
acagcgagga ggagcgcatt ttgcgctacg tgcagcagag cgtgagcctt aacctgatgc 13320 
gcgacggggt aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca 13380 
tgtatgcctc aaaccggccg tttatcaacc gcctaatgga ctacttgcat cgcgcggccg 13440 
ccgtgaaccc cgagtatttc accaatgcca tcrtgaaccc gcactggcta ccgccccctg 13500 
gtttctacac cgggggattc gaggtgcccg agggtaacga tggattcctc tgggacgaca 13560 
tagacgacag cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc 13620 
aggcagaggc ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag 13680 
gcgctgcggc cccgcggtca gatgctagta gcccatttcc aagcttgata gggtctctta 13740 
ccagcactcg caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc 13800 
tgctgcagcc gcagcgcgaa aaaaacctgc ctccggcatt tcccaacaac gggatagaga 13860 
gcctagtgga caagatgagt agatggaaga cgtacgcgca ggagcacagg gacgtgccag 13920 
gcccgcgccc gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg 13980 
acgatgactc ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg 14040 
cgcaccttcg ccccaggctg gggagaatgt tttaaaaaaa aaaaagcatg atgcaaaata 14100 
aaaaactcac caaggccatg gcaccgagcg ttggttttct tgtattcccc ttagtatgcg 14160 
gcgcgcggcg atgtatgagg aaggtcctcc tccctcctac gagagtgtgg tgagcgcggc 14220 
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gccagtggcg gcggcgctgg gttctccctt cgatgctccc ctggacccgc cgtttgtgcc 14280 
tccgcggtac ctgcggccta ccggggggag aaacagcatc cgttactctg agttggcacc 14340 
cctattcgac accacccgtg tgtacctggt ggacaacaag tcaacggatg tggcatccct 14400 
gaactaccag aacgaccaca gcaactttct gaccacggtc attcaaaaca atgactacag 14460 
cccgggggag gcaagcacac agaccatcaa tcttgacgac cggtcgcact ggggcggcga 14520 
cctgaaaacc atcctgcata ccaacatgcc aaatgtgaac gagttcatgt ttaccaataa 14580 
gtttaaggcg cgggtgatgg tgtcgcgctt gcctactaag gacaatcagg tggagctgaa 14640 
atacgagtgg gtggagttca cgctgcccga gggcaactac tccgagacca tgaccataga 14700 
ccttatgaac aacgcgatcg tggagcacta cttgaaagtg ggcagacaga acggggttct 14760 
ggaaagcgac atcggggtaa agtttgacac ccgcaacttc agactggggt ttgaccccgt 14820 
cactggtctt gtcatgcctg gggtatatac aaacgaagcc ttccatccag acatcatttt 14880 
gctgccagga tgcggggtgg acttcaccca cagccgcctg agcaacttgt tgggcatccg 14 940 
caagcggcaa cccttccagg agggctttag gatcacctac gatgatctgg agggtggtaa 15000 
cattcccgca ctgttggatg tggacgccta ccaggcgagc ttgaaagatg acaccgaaca 15060 
gggcgggggt ggcgcaggcg gcagcaacag cagtggcagc ggcgcggaag agaactccaa 15120 
cgcggcagcc gcggcaatgc agccggtgga ggacatgaac gatcatgcca ttcgcggcga 15180 
cacctttgcc acacgggctg aggagaagcg cgctgaggcc gaagcagcgg ccgaagctgc 1524 0 
cgcccccgct gcgcaacccg aggtcgagaa gcctcagaag aaaccggtga tcaaacccct 15300 
gacagaggac agcaagaaac gcagttacaa cctaataagc aatgacagca ccttcaccca 15360 
gtaccgcagc tggtaccttg catacaacta cggcgaccct cagaccggaa tccgctcatg 15420 
gaccctgctt tgcactcctg acgtaacctg cggctcggag caggtctact ggtcgttgcc 15480 
agacatgatg caagaccccg tgaccttccg ctccacgcgc cagatcagca actttccggt 15540 
ggtgggcgcc gagctgttgc ccgtgcactc caagagcttc tacaacgacc aggccgtcta 15600 
ctcccaactc atccgccagt ttacctctct gacccacgtg ttcaatcgct ttcccgagaa 15660 
ccagattttg gcgcgcccgc cagcccccac catcaccacc gtcagtgaaa acgttcctgc 15720 
tctcacagat cacgggacgc taccgctgcg caacagcatc ggaggagtcc agcgagtgac 15780 
cattactgac gccagacgcc gcacctgccc ctacgtttac aaggccctgg gcatagtctc 15840 
gccgcgcgtc ctatcgagcc gcactttttg agcaagcatg tccatcctta tatcgcccag 15900 
caataacaca ggctggggcc tgcgcttccc aagcaagatg tttggcgggg ccaagaagcg 15960 
ctccgaccaa cacccagtgc gcgtgcgcgg gcactaccgc gcgccctggg gcgcgcacaa 16020 
acgcggccgc actgggcgca ccaccgtcga tgacgccatc gacgcggtgg tggaggaggc 16080 
gcgcaactac acgcccacgc cgccaccagt gtccacagtg gacgcggcca ttcagaccgt 16140 
ggtgcgcgga gcccggcgct atgctaaaat gaagagacgg cggaggcgcg tagcacgtcg 16200 
ccaccgccgc cgacccggca ctgccgccca acgcgcggcg gcggccctgc ttaaccgcgc 16260 
acgtcgcacc ggccgacggg cggccatgcg ggccgctcga aggctggccg cgggtattgt 16320 
cactgtgccc cccaggtcca ggcgacgagc ggccgccgca gcagccgcgg ccattagtgc 16380 
tatgactcag ggtcgcaggg gcaacgtgta ttgggtgcgc gactcggtta gcggcctgcg 16440 
cgtgcccgtg cgcacccgcc ccccgcgcaa ctagattgca agaaaaaact acttagactc 16500 
gtactgttgt atgtatccag cggcggcggc gcgcaacgaa gctatgtcca agcgcaaaat 16560 
caaagaagag atgctccagg tcatcgcgcc ggagatctat ggccccccga agaaggaaga 16620 
gcaggattac aagccccgaa agctaaagcg ggtcaaaaag aaaaagaaag atgatgatga 16680 
tgaacttgac gacgaggtgg aactgctgca cgctaccgcg cccaggcgac gggtacagtg 16740 
gaaaggtcga cgcgtaaaac gtgttttgcg acccggcacc accgtagtct ttacgcccgg 16800 
tgagcgctcc acccgcacct acaagcgcgt gtatgatgag gtgtacggcg acgaggacct 16860 
gcttgagcag gccaacgagc gcctcgggga gtttgcctac ggaaagcggc ataaggacat 16920 
gctggcgttg ccgctggacg agggcaaccc aacacctagc ctaaagcccg taacactgca 16980 
gcaggtgctg cccgcgcttg caccgtccga agaaaagcgc ggcctaaagc gcgagtctgg 17040 
tgacttggca cccaccgtgc agctgatggt acccaagcgc cagcgactgg aagatgtctt 17100 
ggaaaaaatg accgtggaac ctgggctgga gcccgaggtc cgcgtgcggc caatcaagca 17160 
ggtggcgccg ggactgggcg tgcagaccgt ggacgttcag atacccacta ccagtagcac 17220 
cagtattgcc accgccacag agggcatgga gacacaaacg tccccggttg cctcagcggt 17280 
ggcggatgcc gcggtgcagg cggtcgctgc ggccgcgtcc aagacctcta cggaggtgca 17340 
aacggacccg tggatgtttc gcgtttcagc cccccggcgc ccgcgcggtt cgaggaagta 174 00 
cggcgccgcc agcgcgctac tgcccgaata tgccctacat ccttccattg cgcctacccc 17460 
cggctatcgt ggctacacct accgccccag aagacgagca actacccgac gccgaaccac 17520 
cactggaacc cgccgccgcc gtcgccgtcg ccagcccgtg ctggccccga tttccgtgcg 17580 
cagggtggct cgcgaaggag gcaggaccct ggtgctgcca acagcgcgct accaccccag 17640 
catcgtttaa aagccggtct ttgtggttct tgcagatatg gccctcacct gccgcctccg 17700 
tttcccggtg ccgggattcc gaggaagaat gcaccgtagg aggggcatgg ccggccacgg 17760 
cctgacgggc ggcatgcgtc gtgcgcacca ccggcggcgg cgcgcgtcgc accgtcgcat 17820 
gcgcggcggt atcctgcccc tccttattcc actgatcgcc gcggcgattg gcgccgtgcc 17880 
cggaattgca tccgtggcct tgcaggcgca gagacactga ttaaaaacaa gttgcatgtg 17940 
gaaaaatcaa aataaaaagt ctggactctc acgctcgctt ggtcctgtaa ctattttgta 18000 
gaatggaaga catcaacttt gcgtctctgg ccccgcgaca cggctcgcgc ccgttcatgg 18060 
gaaactggca agatatcggc accagcaata tgagcggtgg cgccttcagc tggggctcgc 18120 
tgtggagcgg cattaaaaat ttcggttcca ccgttaagaa ctatggcagc aaggcctgga 18180 
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acagcagcac aggccagatg ctgagggata agttgaaaga gcaaaatttc caacaaaagg 18240 
tggtagatgg cctggcctct ggcattagcg gggtggtgga cctggccaac caggcagtgc 18300 
aaaataagat taacagtaag cttgatcccc gccctcccgt agaggagcct ccaccggccg 18360 
tggagacagt gtctccagag gggcgtggcg aaaagcgtcc gcgccccgac agggaagaaa 18420 
ctctggtgac gcaaatagac gagcctccct cgtacgagga ggcactaaag caaggcctgc 18480 
ccaccacccg tcccatcgcg cccatggcta ccggagtgct gggccagcac acacccgtaa 18540 
cgctggacct gcctcccccc gccgacaccc agcagaaacc tgtgctgcca ggcccgaccg 18600 
ccgttgttgt aacccgtcct agccgcgcgt ccctgcgccg cgccgccagc ggtccgcgat 18 660 
cgttgcggcc cgtagccagt ggcaactggc aaagcacact gaacagcatc gtgggtctgg 18720 
gggtgcaatc cctgaagcgc cgacgatgct tctgaatagc taacgtgtcg tatgtgtgtc 18780 
atgtatgcgt ccatgtcgcc gccagaggag ctgctgagcc gccgcgcgcc cgctttccaa 18840 
gatggctacc ccttcgatga tgccgcagtg gtcttacatg cacatctcgg gccaggacgc 18900 
ctcggagtac ctgagccccg ggctggtgca gtttgcccgc gccaccgaga cgtacttcag 18960 
cctgaataac aagtttagaa accccacggt ggcgcctacg cacgacgtga ccacagaccg 19020 
gtcccagcgt ttgacgctgc ggttcatccc tgtggaccgt gaggatactg cgtactcgta 19080 
caaggcgcgg ttcaccctag ctgtgggtga taaccgtgtg ctggacatgg cttccacgta 19140 
ctttgacatc cgcggcgtgc tggacagggg ccctactttt aagccctact ctggcactgc 19200 
ctacaacgcc ctggctccca agggtgcccc aaatccttgc gaatgggatg aagctgctac 19260 
tgctcttgaa ataaacctag aagaagagga cgatgacaac gaagacgaag tagacgagca 19320 
agctgagcag caaaaaactc acgtatttgg gcaggcgcct tattctggta taaatattac 19380 
aaaggagggt attcaaatag gtgtcgaagg tcaaacacct aaatatgccg ataaaacatt 1944 0 
tcaacctgaa cctcaaatag gagaatctca gtggtacgaa actgaaatta atcatgcagc 19500 
tgggagagtc cttaaaaaga ctaccccaat gaaaccatgt tacggttcat atgcaaaacc 19560 
cacaaatgaa aatggagggc aaggcattct tgtaaagcaa caaaatggaa agctagaaag 19620 
tcaagtggaa atgcaatttt tctcaactac tgaggcgacc gcaggcaatg gtgataactt 19680 
gactcctaaa gtggtattgt acagtgaaga tgtagatata gaaaccccag acactcatat 19740 
ttcttacatg cccactatta aggaaggtaa ctcacgagaa ctaatgggcc aacaatctat 19800 
gcccaacagg cctaattaca ttgcttttag ggacaatttt attggtctaa tgtattacaa 19860 
cagcacgggt aatatgggtg ttctggcggg ccaagcatcg cagttgaatg ctgttgtaga 19920 
tttgcaagac agaaacacag agctttcata ccagcttttg cttgattcca ttggtgatag 19980 
aaccaggtac ttttctatgt ggaatcaggc tgttgacagc tatgatccag atgttagaat 20040 
tattgaaaat catggaactg aagatgaact tccaaattac tgctttccac tgggaggtgt 20100 
gattaataca gagactctta ccaaggtaaa acctaaaaca ggtcaggaaa atggatggga 20160 
aaaagatgct acagaatttt cagataaaaa tgaaataaga gttggaaata attttgccat 20220 
ggaaatcaat ctaaatgcca acctgtggag aaatttcctg tactccaaca tagcgctgta 20280 
tttgcccgac aagctaaagt acagtccttc caacgtaaaa atttctgata acccaaacac 20340 
ctacgactac atgaacaagc gagtggtggc tcccgggtta gtggactgct acattaacct 20400 
tggagcacgc tggtcccttg actatatgga caacgtcaac ccatttaacc accaccgcaa 20460 
tgctggcctg cgctaccgct caatgttgct gggcaatggt cgctatgtgc ccttccacat 20520 
ccaggtgcct cagaagttct ttgccattaa aaacctcctt ctcctgccgg gctcatacac 20580 
ctacgagtgg aacttcagga aggatgttaa catggttctg cagagctccc taggaaatga 20640 
cctaagggtt gacggagcca gcattaagtt tgatagcatt tgcctttacg ccaccttctt 20700 
ccccatggcc cacaacaccg cctccacgct tgaggccatg cttagaaacg acaccaacga 20760 
ccagtccttt aacgactatc tctccgccgc caacatgctc taccctatac ccgccaacgc 20820 
taccaacgtg cccatatcca tcccctcccg caactgggcg gctttccgcg gctgggcctt 20880 
cacgcgcctt aagactaagg aaaccccatc actgggctcg ggctacgacc cttattacac 20940 
ctactctggc tctataccct acctagatgg aaccttttac ctcaaccaca cctttaagaa 21000 
ggtggccatt acctttgact cttctgtcag ctggcctggc aatgaccgcc tgcttacccc 21060 
caacgagttt gaaattaagc gctcagttga cggggagggt tacaacgttg cccagtgtaa 21120 
catgaccaaa gactggttcc tggtacaaat gctagctaac tacaacattg gctaccaggg 21180 
cttctatatc ccagagagct acaaggaccg catgtactcc ttctttagaa acttccagcc 21240 
catgagccgt caggtggtgg atgatactaa atacaaggac taccaacagg tgggcatcct 21300 
acaccaacac aacaactctg gatttgttgg ctaccttgcc cccaccatgc gcgaaggaca 21360 
ggcctaccct gctaacttcc cctatccgct tataggcaag accgcagttg acagcattac 21420 
ccagaaaaag tttctttgcg atcgcaccct ttggcgcatc ccattctcca gtaactttat 21480 
gtccatgggc gcactcacag acctgggcca aaaccttctc tacgccaact ccgcccacgc 21540 
gctagacatg acttttgagg tggatcccat ggacgagccc acccttcttt atgttttgtt 21600 
tgaagtcttt gacgtggtcc gtgtgcaccg gccgcaccgc ggcgtcatcg aaaccgtgta 21660 
cctgcgcacg cccttctcgg ccggcaacgc cacaacataa agaagcaagc aacatcaaca 21720 
acagctgccg ccatgggctc cagtgagcag gaactgaaag ccattgtcaa agatcttggt 21780 
tgtgggccat attttttggg cacctatgac aagcgctttc caggctttgt ttctccacac 21840 
aagctcgcct gcgccatagt caatacggcc ggtcgcgaga ctgggggcgt acactggatg 21900 
gcctttgcct ggaacccgca ctcaaaaaca tgctacctct ttgagccctt tggcttttct 21960 
gaccagcgac tcaagcaggt ttaccagttt gagtacgagt cactcctgcg ccgtagcgcc 22020 
attgcttctt cccccgaccg ctgtataacg ctggaaaagt ccacccaaag cgtacagggg 22080 
cccaactcgg ccgcctgtgg actattctgc tgcatgtttc tccacgcctt tgccaactgg 22140 
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ccccaaactc ccatggatca caaccccacc atgaacctta ttaccggggt acccaactcc 22200 
atgctcaaca gtccccaggt acagcccacc ctgcgtcgca accaggaaca gctctacagc 22260 
ttcctggagc gccactcgcc ctacttccgc agccacagtg cgcagattag gagcgccact 22320 
tctttttgtc acttgaaaaa catgtaaaaa taatgtacta gagacacttt caataaaggc 22380 
aaatgctttt atttgtacac tctcgggtga ttatttaccc ccacccttgc cgtctgcgcc 22440 
gtttaaaaat caaaggggtt ctgccgcgca tcgctatgcg ccactggcag ggacacgttg 22500 
cgatactggt gtttagtgct ccacttaaac tcaggcacaa ccatccgcgg cagctcggtg 22560 
aagttttcac tccacaggct gcgcaccatc accaacgcgt ttagcaggtc gggcgccgat 22620 
atcttgaagt cgcagttggg gcctccgccc tgcgcgcgcg agttgcgata cacagggttg 22680 
cagcactgga acactatcag cgccgggtgg tgcacgctgg ccagcacgct cttgtcggag 22740 
atcagatccg cgtccaggtc ctccgcgttg ctcagggcga acggagtcaa ctttggtagc 22800 
tgccttccca aaaagggcgc gtgcccaggc tttgagttgc actcgcaccg tagtggcatc 22860 
aaaaggtgac cgtgcccggt ctgggcgtta ggatacagcg cctgcataaa agccttgatc 22920 
tgcttaaaag ccacctgagc ctttgcgcct tcagagaaga acatgccgca agacttgccg 22980 
gaaaactgat tggccggaca ggccgcgtcg tgcacgcagc accttgcgtc ggtgttggag 23040 
atctgcacca catttcggcc ccaccggttc ttcacgatct tggccttgct agactgctcc 23100 
ttcagcgcgc gctgcccgtt ttcgctcgtc acatccattt caatcacgtg ctccttattt 23160 
atcataatgc ttccgtgtag acacttaagc tcgccttcga tctcagcgca gcggtgcagc 23220 
cacaacgcgc agcccgtggg ctcgtgatgc ttgtaggtca cctctgcaaa cgactgcagg 23280 
tacgcctgca ggaatcgccc catcatcgtc acaaaggtct tgttgctggt gaaggtcagc 23340 
tgcaacccgc ggtgctcctc gttcagccag gtcttgcata cggccgccag agcttccact 234 00 
tggtcaggca gtagtttgaa gttcgccttt agatcgttat ccacgtggta cttgtccatc 23460 
agcgcgcgcg cagcctccat gcccttctcc cacgcagaca cgatcggcac actcagcggg 23520 
ttcatcaccg taatttcact ttccgcttcg ctgggctctt cctcttcctc ttgcgtccgc 23580 
ataccacgcg ccactgggtc gtcttcattc agccgccgca ctgtgcgctt acctcctttg 23640 
ccatgcttga ttagcaccgg tgggttgctg aaacccacca tttgtagcgc cacatcttct 23700 
ctttcttcct cgctgtccac gattacctct ggtgatggcg ggcgctcggg cttgggagaa 23760 
gggcgcttct ttttcttctt gggcgcaatg gccaaatccg ccgccgaggt cgatggccgc 23820 
gggctgggtg tgcgcggcac cagcgcgtct tgtgatgagt cttcctcgtc ctcggactcg 23880 
atacgccgcc tcatccgctt ttttgggggc gcccggggag gcggcggcga cggggacggg 23940 
gacgacacgt cctccatggt tgggggacgt cgcgccgcac cgcgtccgcg ctcgggggtg 24000 
gtttcgcgct gctcctcttc ccgactggcc atttccttct cctataggca gaaaaagatc 24060 
atggagtcag tcgagaagaa ggacagccta accgccccct ctgagttcgc caccaccgcc 24120 
tccaccgatg ccgccaacgc gcctaccacc ttccccgtcg aggcaccccc gcttgaggag 24180 
gaggaagtga ttatcgagca ggacccaggt tttgtaagcg aagacgacga ggaccgctca 24240 
gtaccaacag aggataaaaa gcaagaccag gacaacgcag aggcaaacga ggaacaagtc 24300 
gggcgggggg acgaaaggca tggcgactac ctagatgtgg gagacgacgt gctgttgaag 24360 
catctgcagc gccagtgcgc cattatctgc gacgcgttgc aagagcgcag cgatgtgccc 24 420 
ctcgccatag cggatgtcag ccttgcctac gaacgccacc tattctcacc gcgcgtaccc 24480 
cccaaacgcc aagaaaacgg cacatgcgag cccaacccgc gcctcaactt ctaccccgta 24540 
tttgccgtgc cagaggtgct tgccacctat cacatctttt tccaaaactg caagataccc 24 600 
ctatcctgcc gtgccaaccg cagccgagcg gacaagcagc tggccttgcg gcagggcgct 24 660 
gtcatacctg atatcgcctc gctcaacgaa gtgccaaaaa tctttgaggg tcttggacgc 24720 
gacgagaagc gcgcggcaaa cgctctgcaa caggaaaaca gcgaaaatga aagtcactct 24780 
ggagtgttgg tggaactcga gggtgacaac gcgcgcctag ccgtactaaa acgcagcatc 24840 
gaggtcaccc actttgccta cccggcactt aacctacccc ccaaggtcat gagcacagtc 24 900 
atgagtgagc tgatcgtgcg ccgtgcgcag cccctggaga gggatgcaaa tttgcaagaa 24 960 
caaacagagg agggcctacc cgcagttggc gacgagcagc tagcgcgctg gcttcaaacg 25020 
cgcgagcctg ccgacttgga ggagcgacgc aaactaatga tggccgcagt gctcgttacc 25080 
gtggagcttg agtgcatgca gcggttcttt gctgacccgg agatgcagcg caagctagag 25140 
gaaacattgc actacacctt tcgacagggc tacgtacgcc aggcctgcaa gatctccaac 25200 
gtggagctct gcaacctggt ctcctacctt ggaattttgc acgaaaaccg ccttgggcaa 25260 
aacgtgcttc attccacgct caagggcgag gcgcgccgcg actacgtccg cgactgcgtt 25320 
tacttatttc tatgctacac ctggcagacg gccatgggcg tttggcagca gtgcttggag 25380 
gagtgcaacc tcaaggagct gcagaaactg ctaaagcaaa acttgaagga cctatggacg 25440 
gccttcaacg agcgctccgt ggccgcgcac ctggcggaca tcattttccc cgaacgcctg 25500 
cttaaaaccc tgcaacaggg tctgccagac ttcaccagtc aaagcatgtt gcagaacttt 25560 
aggaacttta tcctagagcg ctcaggaatc ttgcccgcca cctgctgtgc acttcctagc 25620 
gactttgtgc ccattaagta ccgcgaatgc cctccgccgc tttggggcca ctgctacctt 25680 
ctgcagctag ccaactacct tgcctaccac tctgacataa tggaagacgt gagcggtgac 25740 
ggtctactgg agtgtcactg tcgctgcaac ctatgcaccc cgcaccgctc cctggtttgc 25800 
aattcgcagc tgcttaacga aagtcaaatt atcggtacct ttgagctgca gggtccctcg 25860 
cctgacgaaa agtccgcggc tccggggttg aaactcactc cggggctgtg gacgtcggct 25920 
taccttcgca aatttgtacc tgaggactac cacgcccacg agattaggtt ctacgaagac 25980 
caatcccgcc cgccaaatgc ggagcttacc gcctgcgtca ttacccaggg ccacattctt 26040 
ggccaattgc aagccatcaa caaagcccgc caagagtttc tgctacgaaa gggacggggg 26100 



Copied from 1020674? on 07-04-2004 



PCT/US00/18971 



gtttacttgg acccccagtc cggcgaggag ctcaacccaa tccccccgcc gccgcagccc 26160 
tatcagcagc agccgcgggc ccttgcttcc caggatggca cccaaaaaga agctgcagct 26220 
gccgccgcca cccacggacg aggaggaata ctgggacagt caggcagagg aggttttgga 26280 
cgaggaggag gaggacatga tggaagactg ggagagccta gacgaggaag cttccgaggt 26340 
cgaagaggtg tcagacgaaa caccgtcacc ctcggtcgca ttcccctcgc cggcgcccca 26400 
gaaatcggca accggttcca gcatggctac aacctccgct cctcaggcgc cgccggcact 264 60 
gcccgttcgc cgacccaacc gtagatggga caccactgga accagggccg gtaagtccaa 26520 
gcagccgccg ccgttagccc aagagcaaca acagcgccaa ggctaccgct catggcgcgg 26580 
gcacaagaac gccatagttg cttgcttgca agactgtggg ggcaacatct ccttcgcccg 26640 
ccgctttctt ctctaccatc acggcgtggc cttcccccgt aacatcctgc attactaccg 26700 
tcatctctac agcccatact gcaccggcgg cagcggcagc ggcagcaaca gcagcggcca 26760 
cacagaagca aaggcgaccg gatagcaaga ctctgacaaa gcccaagaaa tccacagcgg 26820 
cggcagcagc aggaggagga gcgctgcgtc tggcgcccaa cgaacccgta tcgacccgcg 26880 
agcttagaaa caggattttt cccactctgt atgctatatt tcaacagagc aggggccaag 26940 
aacaagagct gaaaataaaa aacaggtctc tgcgatccct cacccgcagc tgcctgtatc 27O00 
acaaaagcga agatcagctt cggcgcacgc tggaagacgc ggaggctctc ttcagtaaat 27060 
actgcgcgct gactcttaag gactagtttc gcgccctttc tcaaatttaa gcgcgaaaac 27120 
tacgtcatct ccagcggcca cacccggcgc cagcacctgt cgtcagcgcc attatgagca 27180 
aggaaattcc cacgccctac atgtggagtt accagccaca aatgggactt gcggctggag 27240 
ctgcccaaga ctactcaacc cgaataaact acatgagcgc gggaccccac atgatatccc 27300 
gggtcaacgg aatccgcgcc caccgaaacc gaattctctt ggaacaggcg gctattacca 27360 
ccacacctcg taataacctt aatccccgta gttggcccgc tgccctggtg taccaggaaa 27420 
gtcccgctcc caccactgtg gtacttccca gagacgccca ggccgaagtt cagatgacta 27480 
actcaggggc gcagcttgcg ggcggctttc gtcacagggt gcggtcgccc gggcagggta 27540 
taactcacct gacaatcaga gggcgaggta ttcagctcaa cgacgagtcg gtgagctcct 27 600 
cgcttggtct ccgtccggac gggacatttc agatcggcgg cgccggccgt ccttcattca 27660 
cgcctcgtca ggcaatccta actctgcaga cctcgtcctc tgagccgcgc tctggaggca 27720 
ttggaactct gcaatttatt gaggagtttg tgccatcggt ctactttaac cccttctcgg 27780 
gacctcccgg ccactatccg gatcaattta ttcctaactt tgacgcggta aaggactcgg 27840 
cggacggcta cgactgaatg ttaagtggag aggcagagca actgcgcctg aaacacctgg 27900 
tccactgtcg ccgccacaag tgctttgccc gcgactccgg tgagttttgc tactttgaat 27960 
tgcccgagga tcatatcgag ggcccggcgc acggcgtccg gcttaccgcc cagggagagc 28020 
ttgcccgtag cctgattcgg gagtttaccc agcgccccct gctagttgag cgggacaggg 28080 
gaccctgtgt tctcactgtg atttgcaact gtcctaacct tggattacat caagatcttt 28140 
gttgccatct ctgtgctgag tataataaat acagaaatta aaatatactg gggctcctat 28200 
cgccatcctg taaacgccac cgtcttcacc cgcccaagca aaccaaggcg aaccttacct 28260 
ggtactttta acatctctcc ctctgtgatt tacaacagtt tcaacccaga cggagtgagt 28320 
ctacgagaga acctctccga gctcagctac tccatcagaa aaaacaccac cctccttacc 28380 
tgccgggaac gtacgagtgc gtcaccggcc gctgcaccac acctaccgcc tgaccgtaaa 28440 
ccagactttt tccggacaga cctcaataac tctgtttacc agaacaggag gtgagcttag 28500 
aaaaccctta gggtattagg ccaaaggcgc agctactgtg gggtttatga acaattcaag 28560 
caactctacg ggctattcta attcaggttt ctctagaagt caggcttcct ggatgtcagc 28620 
atctgacttt ggccagcacc tgtcccgcgg atttgttcca gtccaactac agcgacccac 28 680 
cctaacagag atgaccaaca caaccaacgc ggccgccgct accggactta catctaccac 287 40 
aaatacaccc caagtttctg cctttgtcaa taactgggat aacttgggca tgtggtggtt 28800 
ctccatagcg cttatgtttg tatgccttat tattatgtgg ctcatctgct gcctaaagcg 28860 
caaacgcgcc cgaccaccca tctatagtcc catcattgtg ctacacccaa acaatgatgg 28 920 
aatccataga ttggacggac tgaaacacat gttcttttct cttacagtat gattaaatga 28980 
gatctagaaa tggacggaat tattacagag cagcgcctgc tagaaagacg cagggcagcg 29040 
gccgagcaac agcgcatgaa tcaagagctc caagacatgg ttaacttgca ccagtgcaaa 29100 
aggggtatct tttgtctggt aaagcaggcc aaagtcacct acgacagtaa taccaccgga 29160 
caccgcctta gctacaagtt gccaaccaag cgtcagaaat tggtggtcat ggtgggagaa 29220 
aagcccatta ccataactca gcactcggta gaaaccgaag gctgcattca ctcaccttgt 29280 
caaggacctg aggatctctg cacccttatt aagaccctgt gcggtctcaa agatcttatt 29340 
ccctttaact aataaaaaaa aataataaag catcacttac ttaaaatcag ttagcaaatt 29400 
tctgtccagt ttattcagca gcacctcctt gccctcctcc cagctctggt attgcagctt 294 60 
cctcctggct gcaaactttc tccacaatct aaatggaatg tcagtttcct cctgttcctg 29520 
tccatccgca cccactatct tcatgttgtt gcagatgaag cgcgcaagac cgtctgaaga 29580 
taccttcaac cccgtgtatc catatgacac ggaaaccggt cctccaactg tgccttttct 29640 
tactcctccc tttgtatccc ccaatgggtt tcaagagagt ccccctgggg tactctcttt 29700 
gcgcctatcc gaacctctag ttacctccaa tggcatgctt gcgctcaaaa tgggcaacgg 29760 
cctctctctg gacgaggccg gcaaccttac ctcccaaaat gtaaccactg tgagcccacc 29820 
tctcaaaaaa accaagtcaa acataaacct ggaaatatct gcacccctca cagttacctc 29880 
agaagcccta actgtggctg ccgccgcacc tctaatggtc gcgggcaaca cactcaccat 29940 
gcaatcacag gccccgctaa ccgtgcacga ctccaaactt agcattgcca cccaaggacc 30000 
cctcacagtg tcagaaggaa agctagccct gcaaacatca ggccccctca ccaccaccga 30060 
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tagcagtacc 


cttactatca 


ctgcctcacc 


ccctctaact 


actgccactg 


gtagcttggg 


30120 


cattgacttg 


aaagagccca 


tttatacaca 


aaatggaaaa 


ctaggactaa 


agtacggggc 


30180 


tcctttgcat 


gtaacagacg 


acctaaacac 




gcaactggtc 


caggtgtgac 


30240 


tattaataat 


acttccttgc 


aaactaaagt 


tactggagcc 


ttgggttttg 




30300 


caatatgcaa 


cttaatgtag 


caggaggact 


aaggattgat 


tctcaaaaca 


gacgccttat 


30360 


acttgatgtt 


agttatccgt 


ttgatgctca 


aaaccaacta 




taggacaggg 


30420 


ccctcttttt 


ataaactcag 


cccacaactt 


ggatattaac 


tacaacaaag 




30480 


gtttacagct 


tcaaacaatt 


ccaaaaagct 


tgaggttaac 


ctaagcactg 


ccaaggggtt 


30540 


gatgtttgac 


gctacagcca 


tagccattaa 


tgcaggagat 


gggcttgaat 


ttggttcacc 


30600 


taatgcacca 


aacacaaatc 


ccctcaaaac 


aaaaattggc 


catggcctag 


aatttgattc 


30660 


aaacaaggct 


atggttccta 


aactaggaac 


tggccttagt 


tttgacagca 


caggtgccat 


30720 


tacagtagga 


aacaaaaata 


atgataagct 


aactttgtgg 


accacaccag 


ctccatctcc 


30780 


taactgtaga 


ctaaatgcag 


agaaagatgc 




ttggtcttaa 




30840 


cagtcaaata 


cttgctacag 


tttcagtttt 


ggctgttaaa 


ggcagtttgg 


ctccaatatc 


30900 


tggaacagtt 


caaagtgctc 


atcttattat 


aagatttgac gaaaatggag 


tgctactaaa 


30960 




ctggacccag 


aatattggaa 






ctgaaggcac 


31020 


agcctataca 


aacgctgttg 


gatttatgcc 




gcttatccaa 


aatctcacgg 


31080 


taaaactgcc 


aaaagtaaca 


ttgtcagtca 




aacggagaca 




31140 


tgtaacacta 


accattacac 


taaacggtac 


acaggaaaca 


ggagacacaa 


ctccaagtgc 


31200 


atactctatg 


tcattttcat 


gggactggtc 


tggccacaac 




aaatatttgc 


31260 


cacatcctct 


tacacttttt 


catacattgc 


ccaagaataa 


agaatcgttt 


gtgttatgtt 


31320 


tcaacgtgtt 


tatttttcaa 


ttgcagaaaa 




tttttcattc 


agtagtatag 


31380 


ccccaccacc 


acatagctta 


tacagatcac 




tcaaactcac 


agaaccctag 


31440 




gccacctccc 


tcccaacaca 




agtcctttct 


ccccggctgg 


31500 


ccttaaaaag 


catcatatca 


tgggtaacag 


acatattctt 


aggtgttata 


ttccacacgg 


31560 


tttcctgtcg 


agccaaacgc 


tcatcagtga 


tattaataaa 


ctccccgggc 


agctcactta 


31620 


agttcatgtc 


gctgtccagc 


tgctgagcca 


caggctgctg 


tccaacttgc 


ggttgcttaa 


31680 


cgggcggcga 


aggagaagtc 


cacgcctaca 


tgggggtaga 


gtcataatcg 


tgcatcagga 


31740 


tagggcggtg 


gtgctgcagc 


agcgcgcgaa 


taaactgctg 


ccgccgccgc 


tccgtcctgc 


31800 


aggaatacaa 


catggcagtg 


gtctcctcag 


cgatgattcg 


caccgcccgc 


agcataaggc 


31860 


gccttgtcct 


ccgggcacag 


cagcgcaccc 


tgatctcact 


taaatcagca 


cagtaactgc 


31920 


agcacagcac 


cacaatattg 




cacagtgcaa 


ggcgctgtat 


ccaaagctca 


31980 


tggcggggac 


cacagaaccc 


acgtggccat 




gcgcaggtag 


attaagtggc 


32040 


gacccctcat 


aaacacgctg 


gacataaaca 


ttacctcttt 


tggcatgttg 




32100 


cctcccggta 


ccatataaac 


ctctgattaa 


acatggcgcc 




atcctaaacc 


32160 


agctggccaa 


aacctgcccg 


ccggctatac 


actgcaggga 


accgggactg 


gaacaatgac 


32220 


agtggagagc 


ccaggactcg 


taaccatgga 


tcatcatgct 


cgtcatgata 


tcaatgttgg 


32280 


cacaacacag 


gcacacgtgc 




tcaggattac 


aagctcctcc 


cgcgttagaa 


32340 


ccatatccca 


gggaacaacc 


cattcctgaa 


tcagcgtaaa 


tcccacactg 


cagggaagac 


32400 


ctcgcacgta 


actcacgttg 


tgcattgtca 


aagtgttaca 


ttcgggcagc 


agcggatgat 


32460 


cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc 


ctactgtacg 


32520 


gagtgcgccg 


agacaaccga 


gatcgtgttg 


gtcgtagtgt 


catgccaaat 


ggaacgccgg 


32580 


acgtagtcat 


atttcctgaa 


gcaaaaccag gtgcgggcgt gacaaacaga 


tctgcgtctc 


32640 


cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt 


atatccactc 


tctcaaagca 


32700 


tccaggcgcc 


ccctggcttc 


gggttctatg 


taaactcctt 


catgcgccgc 


tgccctgata 


32760 




ccgcagaata 


agccacaccc 


agccaaccta cacattcgtt 


ctgcgagtca 


32820 


cacacgggag 


gagcgggaag 


agctggaaga 


accatgtttt 


tttttttatt 


ccaaaagatt 


32880 


atccaaaacc 


tcaaaatgaa 


gatctattaa 


gtgaacgcgc 


tcccctccgg 


tggcgtggtc 


32940 




gccaaagaac 


agataatggc 


atttgtaaga 


tgttgcacaa 


tggcttccaa 


33000 


aaggcaaacg 


gccctcacgt 


ccaagtggac 


gtaaaggcta aacccttcag 


ggtgaatctc 


33060 




attccagcac 


cttcaaccat 


gcccaaataa 




gccaccttct 


33120 




ctaagcaaat 


cccgaatatt 


aagtccggcc 


attgtaaaaa 


tctgctccag 


33180 


agcgccctcc 


accttcagcc 


tcaagcagcg 


aatcatgatt 


gcaaaaattc 


aggttcctca 


33240 


cagacctgta 


taagattcaa 


aagcggaaca 


ttaacaaaaa 


taccgcgatc 


ccgtaggtcc 


33300 


cttcgcaggg 


ccagctgaac 


ataatcgtgc 


aggtctgcac 


ggaccagcgc 


ggccacttcc 


33360 


ccgccaggaa 


ccttgacaaa 


agaacccaca 


ctgattatga cacgcatact 


cggagctatg 


33420 


ctaaccagcg 


tagccccgat 


gtaagctttg 


ttgcatgggc ggcgatataa 


aatgcaaggt 


33480 


gctgctcaaa 




aagcctcgcg 


caaaaaagaa 


agcacatcgt 


agtcatgctc 


33540 


atgcagataa 


aggcaggtaa 


gctccggaac 


caccacagaa 


aaagacacca 


tttttctctc 


33600 


aaacatgtct 


gcgggtttct 






aacaaaaaaa 


catttaaaca 


33660 


ttagaagcct 




aggaaaaaca 




gcataagacg 


gactacggcc 


33720 


atgccggcgt 


jaccgtaaaa 


aaactggtca 


ccgtgattaa 


aaagcaccac 


cgacagctcc 


33780 


tcggtcatgt 


ccggagtcat 


aatgtaagac 


tcggtaaaca 


catcaggttg 


attcatcggt 


33840 


cagtgctaaa 


aagcgaccga 


aatagcccgg 


gggaatacat 


acccgcaggc 


gtagagacaa 


33900 


cattacagcc 


cccataggag 


gtataacaaa 


attaatagga 


gagaaaaaca 




33960 


tgaaaaaccc tcctgcctag gcaaaatagc accctcccgc tccagaacaa 


catacagcgc 


34020 
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ccagtaaaaa agaaaaccta ttaaaaaaac 34080 

cacagtgtaa aaaagggcca agtgcagagc 34140 

cggttaaagt ccacaaaaaa cacccagaaa 34200 

agccaaaaaa cccacaactt cctcaaatcg 34260 

cccattttaa gaaaactaca attcccaaca 34320 

tcacccgccc cgttcccacg ccccgcgcca 34380 

ttggcttcaa tccaaaataa ggtatattat 34440 
34448 

<210> 5 
<211> 94 
<212> PRT 

<213> Adenovirus subgroup C 



ttcacagcgg 

gagtatatat 
accgcacgcg 
tcacttccgt 

cgtcacaaac 
tgatgatg 



cagcctaaca 
acggcaccag 
aggactaaaa 

tttcccacgt 

tccaccccct 



ctcaatcagt 
aatgacgtaa 
cagaaacgaa 
tacgtaactt 



<400> 5 

Met Val Asp Thr Val Asn Ser Tyr Asn Thr Ala Thr Gly Leu Thr Ser 
15 10 15 

Ala Leu Asn Leu Pro Gin Val Ser Thr Phe Val Asn Asn Trp Ala Asn 
20 25 30 



Leu Gly Met TrD Trp Phe Ser lie 
35 40 

He Met Trp Leu Ser Cys Cys Leu 
50 55 

He Tyr Lys Pro He He Val Leu 
65 70 

Arg Leu Asp Gly Leu Asn Thr Cys 
85 



Ala Leu Met Phe Val Cys Leu He 
45 

Lys Arg Lys Arg Ala Arg Pro Pro 
60 

Asn Pro Asn Asn Asp Gly He His 
75 80 

Ser Phe Ser Phe Ala Val 
90 



<210> 6 
<2H> 101 
<212> PRT 

<213> Adenovirus subgroup C 



<400> 6 

Met Thr Glv Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 



Val Asn Asp Trp Ala Ser Leu Asp 
35 40 

Met Phe Val Cys Leu He He Met 
50 55 

Arg Arg Ala Arg Pro Pro He Tyr 
65 70 

His Asn Glu Lys He His Arg Leu 
85 



Met Trp Trp Phe Ser He Ala Leu 
45 

Trp Leu He Cys Cys Leu Lys Arg 
60 

Arg Pro He He Val Leu Asn Pro 
75 80 

Asp Gly Leu Lys Pro Cys Ser Leu 
90 95 



Leu Leu Gin Tyr Asp 
100 



<210> 7 
<211> 93 
<212> PRT 

<213> Adenovirus subgroup C 
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<400> 7 

Met Thr Asn Thr Thr Asn Ala Ala 
1 5 

Thr Asn Thr Pro Gin Val Ser Ala 
20 

Gly Met Trp Trp Phe Ser lie Ala 

35 40 

Met Trp Leu lie Cys Cys Leu Lys 
50 55 

Tyr Ser Pro He He Val Leu His 
65 7C 

Leu Asp Gly Leu Lys His Met Phe 
85 



Ala Ala Thr Gly Leu Thr Ser Thr 
10 15 



Phe Val Asn Asn Trp Asp Asn Leu 
25 30 



Leu Met Phe Val Cys Leu He He 
45 



Arg Lys Arg Ala Arg Pro Pro He 
60 



Pro Asn Asn Asp Gly He His Arg 
75 80 



Phe Ser Leu Thr Val 
90 



<210> 8 
<211> 95 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 8 

Met Val Asp Thr Val Asn Ser Tyr Asn Thr Ala Thr Gly Leu Lys Ser 
1 5 10 15 

Ala Leu Asn Leu Pro Gin Val His Ala Phe Val Asn Asp Trp Ala Ser 
20 25 30 

Leu Gly Met Trp Trp Phe Ser He Ala Leu Met Phe Val Cys Leu He 
35 40 45 

He Met Trp Leu He Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro 
50 55 60 

He Tyr Arg Pro He He Val Leu Asn Pro His Asn Glu Lys He His 



Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
85 90 95 



<210> 9 
<211> 78 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 9 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu 
65 70 75 
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<210> 10 
<211> 87 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 10 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He Gly Leu Lys Pro Cys 
65 70 75 80 

Ser Leu Leu Leu Gin Tyr Asp 
85 



<210> 11 
<211> 77 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 11 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro Ser Leu Leu Leu Gin Tyr Asp 
65 ' 70 75 



<210> 12 
<211> 84 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 12 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin He Ala Leu Met 
20 25 30 

Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg Arg 
35 40 45 

Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu Asn Pro His 
50 55 60 
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Asn Glu Lys lie His Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu 
65 70 75 80 

Leu Gin Tyr Asp 



<210> 13 
<211> 35724 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 13 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 
cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 
tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 
tccgacaccg ggactgaaaa tgagacatga ggtactggct gataatcttc cacctcctag 600 
ccattttgaa ccacctaccc ttcacgaact gtatgattta gacgtgacgg cccccgaaga 660 
tcccaacgag gaggcggttt cgcagatttt tcccgactct gtaatgttgg cggtgcagga 720 
agggattgac ttactcactt ttccgccggc gcccggttct ccggagccgc ctcacctttc 780 
ccggcagccc gagcagccgg agcagagagc cttgggtccg gtttgccacg aggctggctt 84 0 
tccacccagt gacgacgagg atgaagaggg tgaggagttt gtgttagatt atgtggagca 900 
ccccgggcac ggttgcaggt cttgtcatta tcaccggagg aatacggggg acccagatat 960 
tatgtgttcg ctttgctata tgaggacctg tggcatgttt gtctacagta agtgaaaatt 1020 
atgggcagtg ggtgatagag tggtgggttt ggtgtggtaa tttttttttt aatttttaca 1080 
gttttgtggt ttaaagaatt ttgtattgtg atttttttaa aaggtcctgt gtctgaacct 1140 
gagcctgagc ccgagccaga accggagcct gcaagaccta cccgccgtcc taaaatggcg 1200 
cctgctatcc tgagacgccc gacatcacct gtgtctagag aatgcaatag tagtacggat 1260 
agctgtgact ccggtccttc taacacacct cctgagatac acccggtggt cccgctgtgc 1320 
cccattaaac cagttgccgt gagagttggt gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgctta acgagcctgg gcaacctttg gacttgagct gtaaacgccc caggccataa 1440 
ggtgtaaacc tgtgattgcg tgtgtggtta acgcctttgt ttgctgaatg agttgatgta 1500 
agtttaataa agggtgagat aatgtttaac ttgcatggcg tgttaaatgg ggcggggctt 1560 
aaagggtata taatgcgccg tgggctaatc ttggttacat ctgacctcat ggaggcttgg 1620 
gagtgtttgg aagatttttc tgctgtgcgt aacttgctgg aacagagctc taacagtacc 1680 
tcttggtttt ggaggtttct gtggggctca tcccaggcaa agttagtctg cagaattaag 1740 
gaggattaca agtgggaatt tgaagagctt ttgaaatcct gtggtgagct gtttgattct 1800 
ttgaatctgg gtcaccaggc gcttttccaa gagaaggtca tcaagacttt ggatttttcc 1860 
acaccggggc gcgctgcggc tgctgttgct tttttgagtt ttataaagga taaatggagc 1920 
gaagaaaccc atctgagcgg ggggtacctg ctggattttc tggccatgca tctgtggaga 1980 
gcggttgtga gacacaagaa tcgcctgcta ctgttgtctt ccgtccgccc ggcgataata 2040 
ccgacggagg agcagcagca gcagcaggag gaagccaggc ggcggcggca ggagcagagc 2100 
ccatggaacc cgagagccgg cctggaccct cgggaatgaa tgttgtacag gtggctgaac 2160 
tgtatccaga actgagacgc attttgacaa ttacagagga tgggcagggg ctaaaggggg 2220 
taaagaggga gcggggggct tgtgaggcta cagaggaggc taggaatcta gcttttagct 2280 
taatgaccag acaccgtcct gagtgtatta cttttcaaca gatcaaggat aattgcgcta 2340 
atgagcttga tctgctggcg cagaagtatt ccatagagca gctgaccact tactggctgc 2400 
agccagggga tgattttgag gaggctatta gggtatatgc aaaggtggca cttaggccag 2460 
attgcaagta caagatcagc aaacttgtaa atatcaggaa ttgttgctac atttctggga 2520 
acggggccga ggtggagata gatacggagg atagggtggc ctttagatgt agcatgataa 2580 
atatgtggcc gggggtgctt ggcatggacg gggtggttat tatgaatgta aggtttactg 2640 
gccccaattt tagcggtacg gttttcctgg ccaataccaa ccttatccta cacggtgtaa 2700 
gcttctatgg gtttaacaat acctgtgtgg aagcctggac cgatgtaagg gttcggggct 2760 
gtgcctttta ctgctgctgg aagggggtgg tgtgtcgccc caaaagcagg gcttcaatta 2820 
agaaatgcct ctttgaaagg tgtaccttgg gtatcctgtc tgagggtaac tccagggtgc 2880 
gccacaatgt ggcctccgac tgtggttgct tcatgctagt gaaaagcgtg gctgtgatta 294 0 
agcataacat ggtatgtggc aactgcgagg acagggcctc tcagatgctg acctgctcgg 3000 
acggcaactg tcacctgctg aagaccattc acgtagccag ccactctcgc aaggcctggc 3060 
cagtgtttga gcataacata ctgacccgct gttccttgca tttgggtaac aggagggggg 3120 
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tgttcctacc ttaccaatgc aatttgagtc acactaagat attgcttgag cccgagagca 3180 
tgtccaaggt gaacctgaac ggggtgtttg acatgaccat gaagatctgg aaggtgctga 3240 
ggtacgatga gacccgcacc aggtgcagac cctgcgagtg tggcggtaaa catattagga 3300 
accagcctgt gatgctggat gtgaccgagg agctgaggcc cgatcacttg gtgctggcct 33 60 
gcacccgcgc tgagtttggc tctagcgatg aagatacaga ttgaggtact gaaatgtgtg 3420 
gccgtggctt aagggtggga aagaatatat aaggtggggg tcttatgtag ttttgtatct 3480 
gttttgcagc agccgccgcc gccatgagca ccaactcgtt tgatggaagc attgtgagct 3540 
catatttgac aacgcgcatg cccccatggg ccggggtgcg tcagaatgtg atgggctcca 3600 
gcattgatgg tcgccccgtc ctgcccgcaa actctactac cttgacctac gagaccgtgt 3660 
ctggaacgcc gttggagact gcagcctccg ccgccgcttc agccgctgca gccaccgccc 3720 
gcgggattgt gactgacttt gctttcctga gcccgcttgc aagcagtgca gcttcccgtt 3180 
catccgcccg cgatgacaag ttgacggctc ttttggcaca attggattct ttgacccggg 3840 
aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 3900 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 3960 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 4020 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 4080 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 4140 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 4200 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 4260 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4 320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4 440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4 500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4 620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4 680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4 74 0 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4 800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4 920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4 980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 504 0 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 534 0 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 54 60 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 5940 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 624 0 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 684 0 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
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agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7 620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
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gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 
cgacctgggc gtttatcgca acgagcgcat 
cgagctcagc gaccgcgagc tgatgcacag 
cggcgataga gaggccgagt cctactttga 
ccgacgcgcc ctggaggcag ctggggccgg 
tggcaacgtc ggcggcgtgg aggaatatga 
cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tctccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
cttaacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtccc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 



ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 114 60 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
ccacaaggcc gtgagcgtga gccggcggcg 11940 
cctgcaaagg gccctggctg gcacgggcag 12000 
cgcgggcgct gacctgcgct gggccccaag 12060 
acctgggctg gcggtggcac ccgcgcgcgc 12120 
cgaggacgat gagtacgagc cagaggacgg 12180 
atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 1254 0 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 134 40 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcctggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggcaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 14460 
tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
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aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 1704 0 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17B20 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 1824 0 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
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ctgcgtactc gtacaaggcg cggttcaccc 
tggcttccac gtactttgac atccgcggcg 
actctggcac tgcctacaac gccctggctc 
atgaagctgc tactgctctt gaaataaacc 
aagtagacga gcaagctgag cagcaaaaaa 
gtataaatat tacaaaggag ggtattcaaa 
ccgataaaac atttcaacct gaacctcaaa 
ttaatcatgc agctgggaga gtccttaaaa 
catatgcaaa acccacaaat gaaaatggag 
gaaagctaga aagtcaagtg gaaatgcaat 
atggtgataa cttgactcct aaagtggtat 
cagacactca tatttcttac atgcccacta 
gccaacaatc tatgcccaac aggcctaatt 
taatgtatta caacagcacg ggtaatatgg 
atgctgttgt agatttgcaa gacagaaaca 
ccattggtga tagaaccagg tacttttcta 
cagatgttag aattattgaa aatcatggaa 
cactgggagg tgtgattaat acagagactc 
aaaatggatg ggaaaaagat gctacagaat 
ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 

acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 



tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 19440 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
ttaaggaagg taactcacga gaactaatgg 19680 
acattgcttt tagggacaat tttattggtc 19740 
gtgttctggc gggccaagca tcgcagttga 19800 
cagagctttc ataccagctt ttgcttgatt 19860 
tgtggaatca ggctgttgac agctatgatc 19920 
ctgaagatga acttccaaat tactgctttc 19980 
ttaccaaggt aaaacctaaa acaggtcagg 20040 
tttcagataa aaatgaaata agagttggaa 20100 
ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 20460 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20540 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 22740 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
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gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24 000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24 420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24 780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
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agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctagg gttggggtta 28500 
ttctctgtct tgtgattctc tttattctta tactaacgct tctctgccta aggctcgccg 28560 
cctgctgtgt gcacatttgc atttattgtc agctttttaa acgctggggt cgccacccaa 28620 
gatgattagg tacataatcc taggtttact cacccttgcg tcagcccacg gtacttaatt 28680 
aacccaaaag gtggatttta aggagccagc ctgtaatgtt acattcgcag ctgaagctaa 28740 
tgagtgcacc actcttataa aatgcaccac agaacatgaa aagctgctta ttcgccacaa 28800 
aaacaaaatt ggcaagtatg ctgtttatgc tatttggcag ccaggtgaca ctacagagta 28860 
taatgttaca gttttccagg gtaaaagtca taaaactttt atgtatactt ttccatttta 28920 
tgaaatgtgc gacattacca tgtacatgag caaacagtat aagttgtggc ccccacaaaa 28980 
ttgtgtggaa aacactggca ctttctgctg cactgctatg ctaattacag tgctcgcttt 29040 
ggtctgtacc ctactctata ttaaatacaa aagcagacgc agctttattg aggaaaagaa 29100 
aatgccttaa tttactaagt tacaaagcta atgtcaccac taactgcttt actcgctgct 29160 
tgcaaaacaa attcaaaaag ttagcattat aattagaata ggatttaaac cccccggtca 29220 
tttcctgctc aataccattc ccctgaacaa ttgactctat gtgggatatg ctccagcgct 29280 
acaaccttga agtcaggctt cctggatgtc agcatctgac tttggccagc acctgtcccg 29340 
cggatttgtt ccagtccaac tacagcgacc caccctaaca gagatgacca acacaaccaa 29400 
cgcggccgcc gctaccggac ttacatctac cacaaataca ccccaagttt ctgcctttgt 29460 
caataactgg gataacttgg gcatgtggtg gttctccata gcgcttatgt ttgtatgcct 29520 
tattattatg tggctcatct gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag 2 9580 
tcccatcatt gtgctacacc caaacaatga tggaatccat agattggacg gactgaaaca 29640 
catgttcttt tctcttacag tatgattaaa tgagacatga ttcctcgagt ttttatatta 29700 
ctgacccttg ttgcgctttt ttgtgcgtgc tccacattgg ctgcggtttc tcacatcgaa 29760 
gtagactgca ttccagcctt cacagtctat ttgctttacg gatttgtcac cctcacgctc 29820 
atctgcagcc tcatcactgt ggtcatcgcc tttatccagt gcattgactg ggtctgtgtg 29880 
cgctttgcat atctcagctg ctgccatgtt gtgttgctac catgttgttt tcatgtgttg 29940 
ctgccatgct cttgtcgcct tagatctctc tttatgtagt gttgtggtgt ctctcttgtc 30000 
gtgatgtgtg ttttgtccta tatattttaa tttttaatcc aaacccctgt ccccgcagag 30060 
gcctttgcgt tctggtaggc cgtcattgaa aactgactta actcgttaaa ttaaaaaaat 30120 
gtaaaaaata atggttgaga ctcagcccaa catcggcaga tgaggtggat tgagactcag 30180 
cccaacatcg gcagatgagg tggattgaga ctcaacccca acattggcag atgaggtgaa 30240 
ttagatgagg tggattgaga ctcatgaggg tggtatgagg gcccgacgtc cacaggtggg 30300 
agttgtgctt tacagtccaa cgtgcaggac gcttggcatt tgccagagaa caccaagatt 30360 
ggcaaattcg caactggcgc cctgtgctct tcacagacgg aaaaatgacc aaaatctgat 30420 
tatttttgta aaacggaaac cgaatgtccg acaaagttca tttgatgact tcccggtagg 30480 
tctgccctgc cgctgggccg acgccgtccg ggaattttac aaacgatttc ggacgtctag 30540 
cattcactca ccttgtcaag gacctgagga tctctgcacc cttattaaga ccctgtgcgg 30600 
tctcaaagat cttattccct ttaactaata aaaaaaaata ataaagcatc acttacttaa 30660 
aatcagttag caaatttctg tccagtttat tcagcagcac ctccttgccc tcctcccagc 30720 
tctggtattg cagcttcctc ctggctgcaa actttctcca caatctaaat ggaatgtcag 30780 
tttcctcctg ttcctgtcca tccgcaccca ctatcttcat gttgttgcag atgaagcgcg 30840 
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caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa accggtcctc 30900 
caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa gagagtcccc 30960 
ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc atgcttgcgc 31020 
tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc caaaatgtaa 31080 
ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa atatctgcac 31140 
ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta atggtcgcgg 31200 
gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc aaacttagca 31260 
ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa acatcaggcc 31320 
ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct ctaactactg 31380 
ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat ggaaaactag 31440 
gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg accgtagcaa 31500 
ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact ggagccttgg 31560 
gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg attgattctc 31620 
aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac caactaaatc 31680 
taagactagg acagggccct ctttttataa actcagccca caacttggat attaactaca 31740 
acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag gttaacctaa 31800 
gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca ggagatgggc 31860 
ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa attggccatg 31920 
gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc cttagttttg 319B0 
acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact ttgtggacca 32040 
caccagctcc atctcctaac tgtagactaa atgcagagaa agatgctaaa ctcactttgg 32100 
tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct gttaaaggca 32160 
gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga tttgacgaaa 32220 
atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt agaaatggag 32280 
atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac ctatcagctt 3234 0 
atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt tacttaaacg 32400 
gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag gaaacaggag 324 60 
acacaactcc aagtgcatac tctatgtcat tttcatggga ctggtctggc cacaactaca 32520 
ttaatgaaat atttgccaca tcctcttaca ctttttcata cattgcccaa gaataaagaa 325B0 
tcgtttgtgt tatgtttcaa cgtgtttatt tttcaattgc agaaaatttc aagtcatttt 32640 
tcattcagta gtatagcccc accaccacat agcttataca gatcaccgta ccttaatcaa 32700 
actcacagaa ccctagtatt caacctgcca cctccctccc aacacacaga gtacacagtc 32760 
ctttctcccc ggctggcctt aaaaagcatc atatcatggg taacagacat attcttaggt 32820 
gttatattcc acacggtttc ctgtcgagcc aaacgctcat cagtgatatt aataaactcc 32880 
ccgggcagct cacttaagtt catgtcgctg tccagctgct gagccacagg ctgctgtcca 32940 
acttgcggtt gcttaacggg cggcgaagga gaagtccacg cctacatggg ggtagagtca 33000 
taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc 33060 
cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat gattcgcacc 33120 
gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa 33180 
tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca gtgcaaggcg 33240 
ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata ccacaagcgc 33300 
aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac ctcttttggc 33360 
atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat ggcgccatcc 33420 
accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg cagggaaccg 334 80 
ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat catgctcgtc 3354 0 
atgatatcaa tgttggcaca acacaggcac acgtgcatac acttcctcag gattacaagc 3360O 
tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag cgtaaatccc 33660 
acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt gttacattcg 33720 
ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga 33780 
cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg 3384 0 
ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca 33900 
aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt tgtagtatat 33960 
ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg 34020 
cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc aacctacaca 34080 
ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca tgtttttttt 34140 
tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga acgcgctccc 34200 
ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt gtaagatgtt 34260 
gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa aggctaaacc 34320 
cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc aaataattct 34380 
catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt ccggccattg 34440 
taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc atgattgcaa 34500 
aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa caaaaatacc 34560 
gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt ctgcacggac 34 620 
cagcgcggcc acttccccgc caggaacctt gacaaaagaa cccacactga ttatgacacg 34680 
catactcgga gctatgctaa ccagcgtagc cccgatgtaa gctttgttgc atgggcggcg 34740 
atataaaatg caaggtgctg ctcaaaaaat caggcaaagc ctcgcgcaaa aaagaaagca 34800 
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catcgtagtc atgctcatgc agataaaggc 

acaccatttt tctctcaaac atgtctgcgg 

aaaaaacatt taaacattag aagcctgtct 

aagacggact acggccatgc cggcgtgacc 

caccaccgac agctcctcgg tcatgtccgg 

aggttgattc .atcggtcagt gctaaaaagc 

gcaggcgtag agacaacatt acagccccca 

aaaacacata aacacctgaa aaaccctcct 

gaacaacata cagcgcttca cagcggcagc 

aacctattaa aaaaacacca ctcgacacgg 

gggccaagtg cagagcgagt atatatagga 

aaaaaacacc cagaaaaccg cacgcgaacc 

caacttcctc aaatcgtcac ttccgttttc 

actacaattc ccaacacata caagttactc 

cccacgcccc gcgccacgtc acaaactcca 

aaataaggta tattattgat gatg 
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aggtaagctc cggaaccacc acagaaaaag 34 860 

gtttctgcat aaacacaaaa taaaataaca 34920 

tacaacagga aaaacaaccc ttataagcat 34980 

gtaaaaaaac tggtcaccgt gattaaaaag 35040 

agtcataatg taagactcgg taaacacatc 35100 

gaccgaaata gcccggggga atacataccc 35160 

taggaggtat aacaaaatta ataggagaga 35220 

gcctaggcaa aatagcaccc tcccgctcca 35280 

ctaacagtca gccttaccag taaaaaagaa 35340 

caccagctca atcagtcaca gtgtaaaaaa 35400 

ctaaaaaatg acgtaacggt taaagtccac 35460 

tacgcccaga aacgaaagcc aaaaaaccca 35520 

ccacgttacg taacttccca ttttaagaaa 35580 

cgccctaaaa cctacgtcac ccgccccgtt 35640 

ccccctcatt atcatattgg cttcaatcca 35700 
35724 



<400> 14 

catcatcaat aatatacctt attttggatt 

ttgtgacgtg gcgcggggcg tgggaacggg 

gatgttgcaa gtgtggcgga acacatgtaa 

gtgtgcgccg gtgtacacag gaagtgacaa 

taaatttggg cgtaaccgag taagatttgg 

agtgaaatct gaataatttt gtgttactca 

gactttgacc gtttacgtgg agactcgccc 

cgggtcaaag ttggcgtttt attattatag 

tgagttcctc aagaggccac tcttgagtgc 

tccgacaccg ggactgaaaa tgagacatga 

ccattttgaa ccacctaccc ttcacgaact 

tcccaacgag gaggcggttt cgcagatttt 

agggattgac ttactcactt ttccgccggc 

ccggcagccc gagcagccgg agcagagagc 

tccacccagt gacgacgagg atgaagaggg 

ccccgggcac ggttgcaggt cttgtcatta 

tatgtgttcg ctttgctata tgaggacctg 

atgggcagtg ggtgatagag tggtgggttt 

gttttgtggt ttaaagaatt ttgtattgtg 

gagcctgagc ccgagccaga accggagcct 

cctgctatcc tgagacgccc gacatcacct 

agctgtgact ccggtccttc taacacacct 

cccattaaac cagttgccgt gagagttggt 

gacttgctta acgagcctgg gcaacctttg 

ggtgtaaacc tgtgattgcg tgtgtggtta 

agtttaataa agggtgagat aatgtttaac 

aaagggtata taatgcgccg tgggctaatc 

gagtgtttgg aagatttttc tgctgtgcgt 

tcttggtttt ggaggtttct gtggggctca 

gaggattaca agtgggaatt tgaagagctt 

ttgaatctgg gtcaccaggc gcttttccaa 

acaccggggc gcgctgcggc tgctgttgct 

gaagaaaccc atctgagcgg ggggtacctg 

gcggttgtga gacacaagaa tcgcctgcta 

ccgacggagg agcagcagca gcagcaggag 

ccatggaacc cgagagccgg cctggaccct 

tgtatccaga actgagacgc attttgacaa 

taaagaggga gcggggggct tgtgaggcta 

taatgaccag acaccgtcct gagtgtatta 

atgagcttga tctgctggcg cagaagtatt 

agccagggga tgattttgag gaggctatta 

attgcaagta caagatcagc aaacttgtaa 

acggggccga ggtggagata gatacggagg 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 4 20 
tcagctgacg tgtagtgtat ttatacccgg 4 80 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 14 40 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 24 00 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
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atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 



gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 354 0 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4 920 
ccagaaggcg ctcgccgccc agcgatagca 4 980 
gtttgagacc gtccgccgta ggcatgcttt 504 0 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 534 0 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 54 60 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 564 0 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5B80 
gctataaaag ggggtggggg cgcgttcgtc 594 0 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 654 0 
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ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 
gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 
tcggagcttg atgacaacat cgcgcagatg 
cgtcaggtca ggcgggagct cctgcaggtt 
tagatccagg tgatacctaa tttccagggg 
gaggccgcat ccccgcggcg cgactacggt 
gtccttggat gatgcatcta aaagcggtga 
tccggacccg ccgggagagg gggcaggggc 
tgctgcgcgc gtaggttgct ggcgaacgcg 
cgcctctgcg tgaagacgac gggcccggtg 
tcaatttcgg tgtcgttgac ggcggcctgg 
tcttgatagg cgatctcggc catgaactgc 
ccggctcgct ccacggtggc ggcgaggtcg 
gcgttgaggc ctccctcgtt ccagacgcgg 
gcgcgcatga ccacctgcgc gagattgagc 
cgcaggcgct gaaagaggta gttgagggtg 
ataacccagc gtcgcaacgt ggattcgttg 
gcctcgtaga agtccacggc gaagttgaaa 
tcctcctcca gaagacggat gagctcggcg 
acaggggcct cttcttcttc ttcaatctcc 
tctggcggcg gtgggggagg ggggacacgg 
acaaagcgct cgatcatctc cccgcggcga 
ttctcgcggg ggcgcagttg gaagacgccg 
gggctgccat gcggcaggga tacggcgcta 
actccgccgc cgagggacct gagcgagtcc 
aaggcgtcta accagtcaca gtcgcaaggt 
cggcggtcgg ggttgtttct ggcggaggtg 
ttgagacggc ggatggtcga cagaagcacc 
aggcggtcgg ccatgcccca ggcttcgttt 
tgcatgagcc tttctaccgg cacttcttct 
tctatcgctg cggcggcggc ggagtttggc 
gtgaccccga agcccctcat cggctgaagc 
aatatggcct gctgcacctg cgtgagggta 
tggtatgcgc ccgtgttgat ggtgtaagtg 
tggtgacccg gctgcgagag ctcggtgtac 
acgtagtcgt tgcaagtccg caccaggtac 
tggcggtaga ggggccagcg tagggtggcc 
aggcgatgat atccgtagat gtacctggac 
gcgcgcggaa agtcgcggac gcggttccag 
gtcgggacgc tctggccggt caggcgcgcg 



tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 7140 
cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 7440 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7620 
acctatggcc attttttctg gggtgatgca 7680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 804 0 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 
ggagctgtcc atggtctgga gctcccgcgg 8280 
tacctcgcat agacgggtca gggcgcgggc 8340 
ctggttggtg gcggcgtcga tggcttgcaa 8400 
accgcgcggc gggcggtggg ccgcgggggt 84 60 
cgcgggcgag cccccggagg tagggggggc 8520 
acgtcggcgc cgcgcgcggg caggagctgg 8580 
acgacgcggc ggttgatctc ctgaatctgg 8640 
agcttgagcc tgaaagagag ttcgacagaa 8700 
cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcgatctctt cctcctggag atctccgcgt 8820 
ttggaaatgc gggccatgag ctgcgagaag 8880 
ctgtagacca cgcccccttc ggcatcgcgg 8940 
tccacgtgcc gggcgaagac ggcgtagttt 9000 
gtggcggtgt gttctgccac gaagaagtac 9060 
atatccccca aggcctcaag gcgctccatg 9120 
aactgggagt tgcgcgccga cacggttaac 9180 
acagtgtcgc gcacctcgcg ctcaaaggct 9240 
tcttccataa gggcctcccc ttcttcttct 9300 
cggcgacgac ggcgcaccgg gaggcggtcg 9360 
cggcgcatgg tctcggtgac ggcgcggccg 9420 
cccgtcatgt cccggttatg ggttggcggg 9480 
acgatgcatc tcaacaattg ttgtgtaggt 9540 
gcatcgaccg gatcggaaaa cctctcgaga 9600 
aggctgagca ccgtggcggg cggcagcggg 9660 
ctgctgatga tgtaattaaa gtaggcggtc 9720 
atgtccttgg gtccggcctg ctgaatgcgc 9780 
tgacatcggc gcaggtcttt gtagtagtct 9840 
tctccttcct cttgtcctgc atctcttgca 9900 
cgtaggtggc gccctcttcc tcccatgcgt 9960 
agggctaggt cggcgacaac gcgctcggct 10020 
gactggaagt catccatgtc cacaaagcgg 10080 
cagttggcca taacggacca gttaacggtc 1014 0 
ctgagacgcg agtaagccct cgagtcaaat 10200 
tggtatccca ccaaaaagtg cggcggcggc 10260 
ggggctccgg gggcgagatc ttccaacata 10320 
atccaggtga tgccggcggc ggtggtggag 10380 
atgttgcgca gcggcaaaaa gtgctccatg 10440 
caatcgttga cgctctagcg tgcaaaagga 10500 
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gagcctgtaa gcgggcactc ttccgtggtc 
ggacgaccgg ggttcgagcc ccgtatccgg 
cgcgtgtcga acccaggtgt gcgacgtcag 
tccaggcgcg gcggctgctg cgctagcttt 
gttaggctgg aaagcgaaag cattaagtgg 
caagggttga gtcgcgggac ccccggttcg 
ggggtttgcc tccccgtcat gcaagacccc 
gccccttttt tgcttttccc agatgcatcc 
gcagcggcaa gagcaagagc agcggcagac 
gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 
cgacctgggc gtttatcgca acgagcgcat 
cgagctcagc gaccgcgagc tgatgcacag 
cggcgataga gaggccgagt cctactttga 
ccgacgcgcc ctggaggcag ctggggccgg 
tggcaacgtc ggcggcgtgg aggaatatga 
cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tctccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
cttaacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtctc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 



PCT/USOO/18971 



tggtggataa attcgcaagg gtatcatggc 10560 

ccgtccgccg tgatccatgc ggttaccgcc 10620 

acaacggggg agtgctcctt ttggcttcct 10680 

tttggccact ggccgcgcgc agcgtaagcg 10740 

ctcgctccct gtagccggag ggttattttc 1OBO0 

agtctcggac cggccggact gcggcgaacg 10B60 

gcttgcaaat tcctccggaa acagggacga 10920 

ggtgctgcgg cagatgcgcc cccctcctca 10980 

atgcagggca ccctcccctc ctcctaccgc 11040 

ggcagcagat ggtgattacg aacccccgcg 11100 

ggagggcgag ggcctggcgc ggctaggagc 11160 

gctgaagcgt gatacgcgtg aggcgtacgt 11220 

gggagaggag cccgaggaga tgcgggatcg 11280 

tggcctgaat cgcgagcggt tgctgcgcga 11340 

gattagtccc gcgcgcgcac acgtggcggc 11400 

ggtgaaccag gagattaact ttcaaaaaag 11460 

gcgcgaggag gtggctatag gactgatgca 11520 

aaacccaaat agcaagccgc tcatggcgca 11580 

caacgaggca ttcagggatg cgctgctaaa 11640 

cgatttgata aacatcctgc agagcatagt 11700 

caaggtggcc gccatcaact attccatgct 11760 

ataccatacc ccttacgttc ccatagacaa 11820 

catggcgctg aaggtgctta ccttgagcga 11880 

ccacaaggcc gtgagcgtga gccggcggcg 11940 

cctgcaaagg gccctggctg gcacgggcag 12000 

cgcgggcgct gacctgcgct gggccccaag 12060 

acctgggctg gcggtggcac ccgcgcgcgc 12120 

cgaggacgat gagtacgagc cagaggacgg 12180 

atgatgcaag acgcaacgga cccggcggtg 12240 

cttaactcca cggacgactg gcgccaggtc 12300 

aatcctgacg cgttccggca gcagccgcag 12360 

gtggtcccgg cgcgcgcaaa ccccacgcac 12 420 

gccgaaaaca gggccatccg gcccgacgag 12480 

cgcgtggctc gttacaacag cggcaacgtg 12540 

gtgcgcgagg ccgtggcgca gcgtgagcgc 12 600 

gttgcactaa acgccttcct gagtacacag 12660 

tacaccaact ttgtgagcgc actgcggcta 12720 

taccagtctg ggccagacta ttttttccag 12780 

ctgagccagg ctttcaaaaa cttgcagggg 12840 

cgcgcgaccg tgtctagctt gctgacgccc 12900 

cccttcacgg acagtggcag cgtgtcccgg 12960 

taccgcgagg ccataggtca ggcgcatgtg 13020 

gtcagccgcg cgctggggca ggaggacacg 13080 

ctgaccaacc ggcggcagaa gatcccctcg 13140 

attttgcgct acgtgcagca gagcgtgagc 13200 

agcgtggcgc tggacatgac cgcgcgcaac 13260 

ccgtttatca accgcctaat ggactacttg 13320 

ttcaccaatg ccatcttgaa cccgcactgg 13380 

ttcgaggtgc ccgagggtaa cgatggattc 13440 

tccccgcaac cgcagaccct gctagagttg 13500 

cgaaaggaaa gcttccgcag gccaagcagc 13560 

tcagatgcta gtagcccatt tccaagcttg 13620 

cgcccgcgcc tgctgggcga ggaggagtac 13680 

gaaaaaaacc tgcctccggc atttcccaac 13740 

agtagatgga agacgtacgc gcaggagcac 13800 

cgtcgtcaaa ggcacgaccg tcagcggggt 13860 

gacagcagcg tcctggattt gggagggagt 13920 

ctggggagaa tgttttaaaa aaaaaaaagc 13980 

atggcaccga gcgttggttt tcttgtattc 14040 

aggaaggtcc tcctccctcc tacgagagtg 14100 

tgggttctcc cttcgatgct cccctggacc 14160 

ctaccggggg gagaaacagc atccgttact 14220 

gtgtgtacct ggtggacaac aagtcaacgg 14280 

acagcaactt tctgaccacg gtcattcaaa 14340 

cacagaccat caatcttgac gaccggtcgc 14 400 

ataccaacat gccaaatgtg aacgagttca 14460 
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tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt • tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt ttggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 
tccagcgagt gaccattact gacgccagac 
tgggcatagt ctcgccgcgc gtcctatcga 
ttatatcgcc cagcaataac acaggctggg 
gggccaagaa gcgctccgac caacacccag 
ggggcgcgca caaacgcggc cgcactgggc 
tggtggagga ggcgcgcaac tacacgccca 
ccattcagac cgtggtgcgc ggagcccggc 
gcgtagcacg tcgccaccgc cgccgacccg 
tgcttaaccg cgcacgtcgc accggccgac 
ccgcgggtat tgtcactgtg ccccccaggt 
cggccattag tgctatgact cagggtcgca 
ttagcggcct gcgcgtgccc gtgcgcaccc 
actacttaga ctcgtactgt tgtatgtatc 
ccaagcgcaa aatcaaagaa gagatgctcc 
cgaagaagga agagcaggat tacaagcccc 
aagatgatga tgatgaactt gacgacgagg 
gacgggtaca gtggaaaggt cgacgcgtaa 
tctttacgcc cggtgagcgc tccacccgca 
gcgacgagga cctgcttgag caggccaacg 
ggcataagga catgctggcg ttgccgctgg 
ccgtaacact gcagcaggtg ctgcccgcgc 
agcgcgagtc tggtgacttg gcacccaccg 
tggaagatgt cttggaaaaa atgaccgtgg 
ggccaatcaa gcaggtggcg ccgggactgg 
ctaccagtag caccagtatt gccaccgcca 
ttgcctcagc ggtggcggat gccgcggtgc 
ctacggaggt gcaaacggac ccgtggatgt 
gttcgaggaa gtacggcgcc gccagcgcgc 
ttgcgcctac ccccggctat cgtggctaca 
gacgccgaac caccactgga acccgccgcc 
cgatttccgt gcgcagggtg gctcgcgaag 
gctaccaccc cagcatcgtt taaaagccgg 
cctgccgcct ccgtttcccg gtgccgggat 
tggccggcca cggcctgacg ggcggcatgc 
cgcaccgtcg catgcgcggc ggtatcctgc 
ttggcgccgt gcccggaatt gcatccgtgg 
caagttgcat gtggaaaaat caaaataaaa 
taactatttt gtagaatgga agacatcaac 
cgcccgttca tgggaaactg gcaagatatc 
agctggggct cgctgtggag cggcattaaa 
agcaaggcct ggaacagcag cacaggccag 
ttccaacaaa aggtggtaga tggcctggcc 
aaccaggcag tgcaaaataa gattaacagt 
cctccaccgg ccgtggagac agtgtctcca 
gacagggaag aaactctggt gacgcaaata 
aagcaaggcc tgcccaccac ccgtcccatc 



tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
tgcagccggt ggaggacatg aacgatcatg 15060 
ctgaggagaa gcgcgctgag gccgaagcag 15120 
ccgaggtcga gaagcctcag aagaaaccgg 15180 
aacgcagtta caacctaata agcaatgaca 15240 
ttgcatacaa ctacggcgac cctcagaccg 15300 
ctgacgtaac ctgcggctcg gagcaggtct 15360 
ccgtgacctt ccgctccacg cgccagatca 15420 
tgcccgtgca ctccaagagc ttctacaacg 15480 
agtttacctc tctgacccac gtgttcaatc 15540 
cgccagcccc caccatcacc accgtcagtg 15600 
cgctaccgct gcgcaacagc atcggaggag 15660 
gccgcacctg cccctacgtt tacaaggccc 15720 
gccgcacttt ttgagcaagc atgtccatcc 15780 
gcctgcgctt cccaagcaag atgtttggcg 15840 
tgcgcgtgcg cgggcactac cgcgcgccct 15900 
gcaccaccgt cgatgacgcc atcgacgcgg 15960 
cgccgccacc agtgtccaca gtggacgcgg 16020 
gctatgctaa aatgaagaga cggcggaggc 16080 
gcactgccgc ccaacgcgcg gcggcggccc 16140 
gggcggccat gcgggccgct cgaaggctgg 16200 
ccaggcgacg agcggccgcc gcagcagccg 16260 
ggggcaacgt gtattgggtg cgcgactcgg 16320 
gccccccgcg caactagatt gcaagaaaaa 16380 
cagcggcggc ggcgcgcaac gaagctatgt 16440 
aggtcatcgc gccggagatc tatggccccc 16500 
gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
tggaactgct gcacgctacc gcgcccaggc 16620 
aacgtgtttt gcgacccggc accaccgtag 16680 
cctacaagcg cgtgtatgat gaggtgtacg 16740 
agcgcctcgg ggagtttgcc tacggaaagc 16800 
acgagggcaa cccaacacct agcctaaagc 16860 
ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
tgcagctgat ggtacccaag cgccagcgac 16980 
aacctgggct ggagcccgag gtccgcgtgc 17040 
gcgtgcagac cgtggacgtt cagataccca 17100 
cagagggcat ggagacacaa acgtccccgg 17160 
aggcggtcgc tgcggccgcg tccaagacct 17220 
ttcgcgtttc agccccccgg cgcccgcgcg 17280 
tactgcccga atatgcccta catccttcca 17340 
cctaccgccc cagaagacga gcaactaccc 17400 
gccgtcgccg tcgccagccc gtgctggccc 17460 
gaggcaggac cctggtgctg ccaacagcgc 17520 
tctttgtggt tcttgcagat atggccctca 17580 
tccgaggaag aatgcaccgt aggaggggca 17640 
gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
ccctccttat tccactgatc gccgcggcga 17760 
ccttgcaggc gcagagacac tgattaaaaa 17820 
agtctggact ctcacgctcg cttggtcctg 17880 
tttgcgtctc tggccccgcg acacggctcg 17940 
ggcaccagca atatgagcgg tggcgccttc 18000 
aatttcggtt ccaccgttaa gaactatggc 18060 
atgctgaggg ataagttgaa agagcaaaat 18120 
tctggcatta gcggggtggt ggacctggcc 18180 
aagcttgatc cccgccctcc cgtagaggag 18240 
gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacgagcctc cctcgtacga ggaggcacta 18360 
gcgcccatgg ctaccggagt gctgggccag 18420 
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cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag racctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 204 60 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 207 60 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat ' atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
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cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact gg'aacactat cagcgccggg tggtgcacgc tggccagcac 22 620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22 980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 234 00 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 234 60 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 237 60 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 242 40 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 243 60 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 244 20 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 244 80 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 247 80 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 252 60 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 254 40 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 257 40 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 2 6040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
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cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 264O0 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga taattaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gaggatcttt gttgccatct ctgtgctgag 27900 
tataataaat acagaaatta aaatatactg gggctcctat cgccatcctg taaacgccac 27960 
cgtcttcacc cgcccaagca aaccaaggcg aaccttacct ggtactttta acatctctcc 28020 
ctctgtgatt tacaacagtt tcaacccaga cggagtgagt ctacgagaga acctctccga 28080 
gctcagctac tccatcagaa aaaacaccac cctccttacc tgccgggaac gtacccttaa 28140 
ttaaaagtca ggcttcctgg atgtcagcat ctgactttgg ccagcacctg tcccgcggat 28200 
ttgttccagt ccaactacag cgacccaccc taacagagat gaccaacaca accaacgcgg 28260 
ccgccgctac cggacttaca tctaccacaa atacacccca agtttctgcc tttgtcaata 28320 
actgggataa cttgggcatg tggtggttct ccatagcgct tatgtttgta tgccttatta 28380 
ttatgtggct catctgctgc ctaaagcgca aacgcgcccg accacccatc tatagtccca 28440 
tcattgtgct acacccaaac aatgatggaa tccatagatt ggacggactg aaacacatgt 28500 
tcttttctct tacagtatga ttaaatgaga ttaattaagg aatttctgtc cagtttattc 28560 
agcagcacct ccttgccctc ctcccagctc tggtattgca gcttcctcct ggctgcaaac 28620 
tttctccaca atctaaatgg aatgtcagtt tcctcctgtt cctgtccatc cgcacccact 28680 
atcttcatgt tgttgcagat gaagcgcgca agaccgtctg aagatacctt caaccccgtg 28740 
tatccatatg acacggaaac cggtcctcca actgtgcctt ttcttactcc tccctttgta 28800 
tcccccaatg ggtttcaaga gagtccccct ggggtactct ctttgcgcct atccgaacct 288 60 
ctagttacct ccaatggcat gcttgcgctc aaaatgggca acggcctctc tctggacgag 28 920 
gccggcaacc ttacctccca aaatgtaacc actgtgagcc cacctctcaa aaaaaccaag 28980 
tcaaacataa acctggaaat atctgcaccc ctcacagtta cctcagaagc cctaactgtg 2904 0 
gctgccgccg cacctctaat ggtcgcgggc aacacactca ccatgcaatc acaggccccg 29100 
ctaaccgtgc acgactccaa acttagcatt gccacccaag gacccctcac agtgtcagaa 29160 
ggaaagctag ccctgcaaac atcaggcccc ctcaccacca ccgatagcag tacccttact 29220 
atcactgcct caccccctct aactactgcc actggtagct tgggcattga cttgaaagag 29280 
cccatttata cacaaaatgg aaaactagga ctaaagtacg gggctccttt gcatgtaaca 2934 0 
gacgacctaa acactttgac cgtagcaact ggtccaggtg tgactattaa taatacttcc 29400 
ttgcaaacta aagttactgg agccttgggt tttgattcac aaggcaatat gcaacttaat 29460 
gtagcaggag gactaaggat tgattctcaa aacagacgcc ttatacttga tgttagttat 29520 
ccgtttgatg ctcaaaacca actaaatcta agactaggac agggccctct ttttataaac 29580 
tcagcccaca acttggatat taactacaac aaaggccttt acttgtttac agcttcaaac 29640 
aattccaaaa agcttgaggt taacctaagc actgccaagg ggttgatgtt tgacgctaca 29700 
gccatagcca ttaatgcagg agatgggctt gaatttggtt cacctaatgc accaaacaca 29760 
aatcccctca aaacaaaaat tggccatggc ctagaatttg attcaaacaa ggctatggtt 29820 
cctaaactag gaactggcct tagttttgac agcacaggtg ccattacagt aggaaacaaa 29880 
aataatgata agctaacttt gtggaccaca ccagctccat ctcctaactg tagactaaat 29940 
gcagagaaag atgctaaact cactttggtc ttaacaaaat gtggcagtca aatacttgct 30000 
acagtttcag ttttggctgt taaaggcagt ttggctccaa tatctggaac agttcaaagt 30060 
gctcatctta ttataagatt tgacgaaaat ggagtgctac taaacaattc cttcctggac 30120 
ccagaatatt ggaactttag aaatggagat cttactgaag gcacagccta tacaaacgct 30180 
gttggattta tgcctaacct atcagcttat ccaaaatctc acggtaaaac tgccaaaagt 30240 
aacattgtca gtcaagttta cttaaacgga gacaaaacta aacctgtaac actaaccatt 30300 
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acactaaacg gtacacagga aacaggagac acaactccaa gtgcatactc tatgtcattt 30360 
tcatgggact ggtctggcca caactacatt aatgaaatat ttgccacatc ctcttacact 30420 
ttttcataca ttgcccaaga ataaagaatc gtttgtgtta tgtttcaacg tgtttatttt 30480 
tcaattgcag aaaatttcaa gtcatttttc attcagtagt atagccccac caccacatag 30540 
cttatacaga tcaccgtacc ttaatcaaac tcacagaacc ctagtattca acctgccacc 30600 
tccctcccaa cacacagagt acacagtcct ttctccccgg ctggccttaa aaagcatcat 30660 
atcatgggta acagacatat tcttaggtgt tatattccac acggtttcct gtcgagccaa 30720 
acgctcatca gtgatattaa taaactcccc gggcagctca cttaagttca tgtcgctgtc 30780 
cagctgctga gccacaggct gctgtccaac ttgcggttgc ttaacgggcg gcgaaggaga 30840 
agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc ggtggtgctg 30900 
cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat acaacatggc 30960 
agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg tcctccgggc 31020 
acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca gcaccacaat 31080 
attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg ggaccacaga 31140 
acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc tcataaacac 31200 
gctggacata aacattacct cttttggcat gttgtaattc accacctccc ggtaccatat 31260 
aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg ccaaaacctg 31320 
cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga gagcccagga 31380 
ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac acaggcacac 31440 
gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat cccagggaac 31500 
aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca cgtaactcac 31560 
gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca gtatggtagc 31620 
gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc gccgagacaa 31680 
ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag tcatatttcc 31740 
tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct cgccgcttag 31800 
atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg cgccccctgg 31860 
cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc accaccgcag 31920 
aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg ggaggagcgg 31980 
gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa aacctcaaaa 32040 
tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc tacagccaaa 32100 
gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca aacggccctc 32160 
acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat aaacattcca 32220 
gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat atctctaagc 32280 
aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc ctccaccttc 32340 
agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc tgtataagat 32400 
tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc agggccagct 32460 
gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca ggaaccttga 32520 
caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc agcgtagccc 32580 
cgatgtaagc tttgttgcat . gggcggcgat ataaaatgca aggtgctgct caaaaaatca 32640 
ggcaaagcct cgcgcaaaaa agaaagcaca tcgtagtcat gctcatgcag ataaaggcag 32700 
gtaagctccg gaaccaccac agaaaaagac accatttttc tctcaaacat gtctgcgggt 327 60 
ttctgcataa acacaaaata aaataacaaa aaaacattta aacattagaa gcctgtctta 32820 
caacaggaaa aacaaccctt ataagcataa gacggactac ggccatgccg gcgtgaccgt 32880 
aaaaaaactg gtcaccgtga ttaaaaagca ccaccgacag ctcctcggtc atgtccggag 32940 
tcataatgta agactcggta- aacacatcag gttgattcat cggtcagtgc taaaaagcga 33000 
ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata 33060 
ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc 33120 
ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttcaca gcggcagcct 33180 
aacagtcagc cttaccagta aaaaagaaaa cctattaaaa aaacaccact cgacacggca 33240 
ccagctcaat cagtcacagt gtaaaaaagg gccaagtgcg ttacactgca gcaggtgtga 33300 
ctcagccatg gcacctctgc agcctgggta ccctgcttgg ggcatggccc cttatagctg 33360 
ggcggggcgt gggggctctg taggagtggc agcgacctca gtgtttgtct ttgctctgaa 33420 
gagccctcca ggtgcttgat cccacctttt cccagcagga acactcctgc ctgccttacc 33480 
acctgtcctg gctgatggcc tgttcctgcc tcctttgccc cctgcccaga ctcccatgtt 3354 0 
cctggacttg tggcttcctc caaccagggg ctctcaagcc tccatacctg gtcccacctc 33600 
tccaggccgt gggagggagg ttgaggaggg tggagggcat ctggttgggg gcagcctggg 33660 
tgttcccctc ccatcccctc cctgggcctc ccaggccccc tctactcttg agcaatgctc 33720 
ttgagagctt cctgcctggc tcttaaccca gggcaagccc tggaagggca gacccaggac 33780 
actctcacca cctccttacc ttttcccctg gaaaaatctt ctgtatactt cccattttaa 33840 
gaaaactaca attcccaaca catacaagtt actccgccct aaaacctacg tcacccgccc 33900 
cgttcccacg ccccgcgcca cgtcacaaac tccaccccct cattatcata ttggcttcaa 33960 
tccaaaataa ggtatattat tgatgatg 33988 

<210> 15 
<211> 34737 
<212> DNA 
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<213> Adenovirus subgroup C 
<4O0> 15 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc. cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 294 0 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
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catccgcccg cgatgacaag ttgacggctc ttttggcaca attggattct ttgacccggg 3840 
aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 3900 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 3960 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 4020 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 4080 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 4140 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 4200 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 4260 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4740 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4 920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4 980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 504 0 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 5340 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 5460 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 564 0 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 594 0 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 6240 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 64B0 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
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cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7 920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg' gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
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ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cg.cgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14 640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt tcggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
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tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 

tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 157B0 

ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 

gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgcccf 15900 

ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 

tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 

ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 

gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 

tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 

ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 

cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 

ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 

actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 

ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 

cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 

aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 

gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 

tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 

gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 

ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 

ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 

agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 

tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 

ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 

ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 

ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 

ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 

gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 1734 0 

ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 

gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 

cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 

gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 

cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 

tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 

cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 

ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 

caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 

taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 

cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 

agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 

agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 

ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 

aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 

cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 

gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 

aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 

cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 

ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 

agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 

atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 

tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 

gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 

cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 

agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18 900 

tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 

ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 

tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19O80 

actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 

atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 

aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 

gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 

ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 

ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 

catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 

gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 

atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
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cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 20460 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
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cgccacatct tctctttctt cctcgctgtc cacgactacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24 300 
cagcgatgtg cccctcgcca tagoggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg aoggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 2574 0 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 2 604 0 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 2 6100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 2634 0 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 2724 0 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
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cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctaga agtcaggctt 28500 
cctggatgtc agcatctgac tttggccagc acctgtcccg cggatttgtt ccagtccaac 28560 
tacagcgacc caccctaaca gagatgacca acacaaccaa cgcggccgcc gctaccggac 28620 
ttacatctac cacaaataca ccccaagttt ctgcctttgt caataactgg gataacttgg 28680 
gcatgtggtg gttctccata gcgcttatgt ttgtatgcct tattattatg tggctcatct 28740 
gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag tcccatcatt gtgctacacc 28800 
caaacaatga tggaatccat agattggacg gactgaaaca catgttcttt tctcttacag 28860 
tatgattaaa tgagatctag aaatggacgg aattattaca gagcagcgcc tgctagaaag 28920 
acgcagggca gcggccgagc aacagcgcat gaatcaagag ctccaagaca tggttaactt 28980 
gcaccagtgc aaaaggggta tcttttgtct ggtaaagcag gccaaagtca cctacgacag 29040 
taataccacc ggacaccgcc ttagctacaa gttgccaacc aagcgtcaga aattggtggt 29100 
catggtggga gaaaagccca ttaccataac tcagcactcg gtagaaaccg aaggctgcat 29160 
tcactcacct tgtcaaggac ctgaggatct ctgcaccctt attaagaccc tgtgcggtct 29220 
caaagatctt attcccttta actaataaaa aaaaataata aagcatcact tacttaaaat 29280 
cagttagcaa atttctgtcc agtttattca gcagcacctc cttgccctcc tcccagctct 29340 
ggtattgcag cttcctcctg gctgcaaact ttctccacaa tctaaatgga atgtcagttt 29400 
cctcctgttc ctgtccatcc gcacccacta tcttcatgtt gttgcagatg aagcgcgcaa 29460 
gaccgtctga agataccttc aaccccgtgt atccatatga cacggaaacc ggtcctccaa 29520 
ctgtgccttt tcttactcct ccctttgtat cccccaatgg gtttcaagag agtccccctg 29580 
gggtactctc tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 29640 
aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa aatgtaacca 29700 
ctgtgagccc acctctcaaa aaaaccaagt caaacataaa cctggaaata tctgcacccc 29760 
tcacagttac ctcagaagcc ctaactgtgg ctgccgccgc acctctaatg gtcgcgggca 29820 
acacactcac catgcaatca caggccccgc taaccgtgca cgactccaaa cttagcattg 2 9880 
ccacccaagg acccctcaca gtgtcagaag gaaagctagc cctgcaaaca tcaggccccc 29940 
tcaccaccac cgatagcagt acccttacta tcactgcctc accccctcta actactgcca 30000 
ctggtagctt gggcattgac ttgaaagagc ccatttatac acaaaatgga aaactaggac 30060 
taaagtacgg ggctcctttg catgtaacag acgacctaaa cactttgacc gtagcaactg 30120 
gtccaggtgt gactattaat aatacttcct tgcaaactaa agttactgga gccttgggtt 30180 
ttgattcaca aggcaatatg caacttaatg tagcaggagg actaaggatt gattctcaaa 30240 
acagacgcct tatacttgat gttagttatc cgtttgatgc tcaaaaccaa ctaaatctaa 30300 
gactaggaca gggccctctt tttataaact cagcccacaa cttggatatt aactacaaca 30360 
aaggccttta cttgtttaca gcttcaaaca attccaaaaa gcttgaggtt aacctaagca 30420 
ctgccaaggg gttgatgttt gacgctacag ccatagccat taatgcagga gatgggcttg 30460 
aatttggttc acctaatgca ccaaacacaa atcccctcaa aacaaaaatt ggccatggcc 3054 0 
tagaatttga ttcaaacaag gctatggttc ctaaactagg aactggcctt agttttgaca 30600 
gcacaggtgc cattacagta ggaaacaaaa ataatgataa gctaactttg tggaccacac 30660 
cagctccatc tcctaactgt agactaaatg cagagaaaga tgctaaactc actttggtct 30720 
taacaaaatg tggcagtcaa atacttgcta cagtttcagt tttggctgtt aaaggcagtt 30780 
tggctccaat atctggaaca gttcaaagtg ctcatcttat tataagattt gacgaaaatg 30840 
gagtgctact aaacaattcc ttcctggacc cagaatattg gaactttaga aatggagatc 30900 
ttactgaagg cacagcctat acaaacgctg ttggatttat gcctaaccta tcagcttatc 30960 
caaaatctca cggtaaaact gccaaaagta acattgtcag tcaagtttac ttaaacggag 31020 
acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa acaggagaca 31080 
caactccaag tgcatactct atgtcatttt catgggactg gtctggccac aactacatta 31140 
atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa taaagaatcg 31200 
tttgtgttat gtttcaacgt gtttattttt caattgcaga aaatttcaag tcatttttca 31260 
ttcagtagta tagccccacc accacatagc ttatacagat caccgtacct taatcaaact 31320 
cacagaaccc tagtattcaa cctgccacct ccctcccaac acacagagta cacagtcctt 31380 
tctccccggc tggccttaaa aagcatcata tcatgggtaa cagacatatt cttaggtgtt 314 40 
atattccaca cggtttcctg tcgagccaaa cgctcatcag tgatattaat aaactccccg 31500 
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ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat tcgcaccgcc 31740 
cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc acttaaatca 31800 
gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg caaggcgctg 31860 
tatccaaagc tcatggcggg gaccacagaa cccacgtggc catcatacca caagcgcagg 31920 
tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc ttttggcatg 31980 
ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc gccatccacc 32040 
accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag ggaaccggga 32100 
ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat gctcgtcatg 32160 
atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat tacaagctcc 32220 
tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt aaatcccaca 32280 
ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc 32340 
agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg aggtagacga 32400 
tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca 32460 
aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg cgtgacaaac 32520 
agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt agtatatcca 32580 
ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc cttcatgcgc 32640 
cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac ctacacattc 32700 
gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt tttttttttt 32760 
attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg cgctcccctc 32820 
cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta agatgttgca 32880 
caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg ctaaaccctt 32940 
cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa taattctcat 33000 
ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg gccattgtaa 33060 
aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg attgcaaaaa 33120 
ttcaggttcc tcacagacct gtataagatt caaaagcgga acattaacaa aaataccgcg 33180 
atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg cacggaccag 33240 
cgcggccact tccccgccag gaaccttgac aaaagaaccc acactgatta tgacacgcat 33300 
actcggagct atgctaacca gcgtagcccc gatgtaagct ttgttgcatg ggcggcgata 33360 
taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 33420 
cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 33480 
ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 33540 
aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 33600 
acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 33660 
caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 33720 
ttgattcatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata catacccgca 33780 
ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata ggagagaaaa 33840 
acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc cgctccagaa 33900 
caacatacag cgcttcacag cggcagccta acagtcagcc ttaccagtaa aaaagaaaac 33960 
ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 34020 
ccaagtgcgt tacactgcag caggtgtgac tcagccatgg cacctctgca gcctgggtac 34080 
cctgcttggg gcatggcccc ttatagctgg gcggggcgtg ggggctctgt aggagtggca 34140 
gcgacctcag tgtttgtctt tgctctgaag agccctccag gtgcttgatc ccaccttttc 34200 
ccagcaggaa cactcctgcc tgccttacca cctgtcctgg ctgatggcct gttcctgcct 34260 
cctttgcccc ctgcccagac tcccatgttc ctggacttgt ggcttcctcc aaccaggggc 34320 
tctcaagcct ccatacctgg tcccacctct ccaggccgtg ggagggaggt tgaggagggt 34380 
ggagggcatc tggttggggg cagcctgggt gttcccctcc catcccctcc ctgggcctcc 34440 
caggccccct ctactcttga gcaatgctct tgagagcttc ctgcctggct cttaacccag 34500 
ggcaagccct ggaagggcag acccaggaca ctctcaccac ctccttacct tttcccctgg 34560 
aaaaatcttc tgtatacttc ccattttaag aaaactacaa ttcccaacac atacaagtta 34620 
ctccgcccta aaacctacgt cacccgcccc gttcccacgc cccgcgccac gtcacaaact 34680 
ccaccccctc attatcatat tggcttcaat ccaaaataag gtatattatt gatgatg 34737 

<210> 16 
<211> 36114 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 16 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
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agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
cttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata . ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 



tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 27 60 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 324 0 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
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tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 4440 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4 620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4 680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 4740 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4 980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 5040 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 5340 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 5400 
gcccttggag gaggcgccgc acgaggggca gtgcagactt' ttgagggcgt agagcttggg 5460 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 594 0 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 624 0 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 654 0 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
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tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8 640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
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cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccgg'c tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12 600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 137 4 0 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 147 60 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt • gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 154 80 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
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tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgc 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
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ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 20460 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 2094 0 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 2250O 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 234 00 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 234 60 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
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cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 241B0 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagca 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24 420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agcgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgctrg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 2550O 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 2754 0 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
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catcaagatc tttgttgcca tctctgtgct 
ctggggctcc tatcgccatc ctgtaaacgc 
gcgaacctta cctggtactt ttaacatctc 
agacggagtg agtctacgag agaacctctc 
caccctcctt acctgccggg aacgtacgag 
gcctgaccgt aaaccagact ttttccggac 
gaggtgagct tagaaaaccc ttagggtatt 
tgaacaattc aagcaactct acgggctatt 
ttctctgtct tgtgattctc tttattctta 
cctgctgtgt gcacatttgc atttattgtc 
gatgattagg tacataatcc taggtttact 
aaaggtggat tttaaggagc cagcctgtaa 
caccactctt ataaaatgca ccacagaaca 
aattggcaag tatgctgttt atgctatttg 
tacagttttc cagggtaaaa gtcataaaac 
gtgcgacatt accatgtaca tgagcaaaca 
ggaaaacact ggcactttct gctgcactgc 
taccctactc tatattaaat acaaaagcag 
ttaatttact aagttacaaa gctaatgtca 
acaaattcaa aaagttagca ttataattag 
gctcaatacc attcccctga acaattgact 
ttgaagtcag gcttcctgga tgtcagcatc 
tgttccagtc caactacagc gacccaccct 
cgccgctacc ggacttacat ctaccacaaa 
ctgggataac ttgggcatgt ggtggttctc 
tatgtggctc atctgctgcc taaagcgcaa 
cattgtgcta cacccaaaca atgatggaat 
cttttctctt acagtatgat taaatgagac 
cttgttgcgc ttttttgtgc gtgctccaca 
tgcattccag ccttcacagt ctatttgctt 
agcctcatca ctgtggtcat cgcctttatc 
gcatatctca gctgctgcca tgttgtgttg 
tgctcttgtc gccttagatc tctctttatg 
tgtgttttgt cctatatatt ttaattttta 
gcgttctggt aggccgtcat tgaaaactga 
aataatggtt gagactcagc ccaacatcgg 
atcggcagat gaggtggatt gagactcaac 
gaggtggatt gagactcatg agggtggtat 
gctttacagt ccaacgtgca ggacgcttgg 
ttcgcaactg gcgccctgtg ctcttcacag 
tgtaaaacgg aaaccgaatg tccgacaaag 
ctgccgctgg gccgacgccg tccgggaatt 
ctcaccttgt caaggacctg aggatctctg 
agatcttatt ccctttaact aataaaaaaa 
ttagcaaatt tctgtccagt ttattcagca 
attgcagctt cctcctggct gcaaactttc 
cctgttcctg tccatccgca cccactatct 
cgtctgaaga taccttcaac cccgtgtatc 
tgccttttct tactcctccc tttgtatccc 
tactctcttt gcgcctatcc gaacctctag 
tgggcaacgg cctctctctg gacgaggccg 
tgagcccacc tctcaaaaaa accaagtcaa 
cagttacctc agaagcccta actgtggctg 
cactcaccat gcaatcacag gccccgctaa 
cccaaggacc cctcacagtg tcagaaggaa 
ccaccaccga tagcagtacc cttactatca 
gtagcttggg cattgacttg aaagagccca 
agtacggggc tcctttgcat gtaacagacg 
caggtgtgac tattaataat acttccttgc 
attcacaagg caatatgcaa cttaatgtag 
gacgccttat acttgatgtt agttatccgt 
taggacaggg ccctcttttt ataaactcag 
gcctttactt gtttacagct tcaaacaatt 
ccaaggggtt gatgtttgac gctacagcca 
ttggttcacc taatgcacca aacacaaatc 
aatttgattc aaacaaggct atggttccta 



gagtataata aatacagaaa ttaaaatata 28080 
caccgtcttc acccgcccaa gcaaaccaag 28140 
tccctctgtg atttacaaca gtttcaaccc 28200 
cgagctcagc tactccatca gaaaaaacac 28260 
tgcgtcaccg gccgctgcac cacacctacc 28320 
agacctcaat aactctgttt accagaacag 28380 
aggccaaagg cgcagctact gtggggttta 28440 
ctaattcagg tttctctagg gttggggtta 28500 
tactaacgct tctctgccta aggctcgccg 28560 
agctttttaa acgctggggt cgccacccaa 28620 
cacccttgcg tcagcccacg gtaccaccca 28680 
tgttacattc gcagctgaag ctaatgagtg 28740 
tgaaaagctg cttattcgcc acaaaaacaa 28800 
gcagccaggt gacactacag agtataatgt 28860 
ttttatgtat acttttccat tttatgaaat 28920 
gtataagttg tggcccccac aaaattgtgt 28980 
tatgctaatt acagtgctcg ctttggtctg 29040 
acgcagcttt attgaggaaa agaaaatgcc 29100 
ccactaactg ctttactcgc tgcttgcaaa 29160 
aataggattt aaaccccccg gtcatttcct 29220 
ctatgtggga tatgctccag cgctacaacc 29280 
tgactttggc cagcacctgt cccgcggatt 29340 
aacagagatg accaacacaa ccaacgcggc 29400 
tacaccccaa gtttctgcct ttgtcaataa 29460 
catagcgctt atgtttgtat gccttattat 29520 
acgcgcccga ccacccatct atagtcccat 29580 
ccatagattg gacggactga aacacatgtt 29640 
atgattcctc gagtttttat attactgacc 29700 
ttggctgcgg tttctcacat cgaagtagac 29760 
tacggatttg tcaccctcac gctcatctgc 29820 
cagtgcattg actgggtctg tgtgcgcttt 29880 
ctaccatgtt gttttcatgt gttgctgcca 29940 
tagtgttgtg gtgtctctct tgtcgtgatg 30000 
atccaaaccc ctgtccccgc agaggccttt 30060 
cttaactcgt taaattaaaa aaatgtaaaa 30120 
cagatgaggt ggattgagac tcagcccaac 30180 
cccaacattg gcagatgagg tgaattagat 30240 
gagggcccga cgtccacagg tgggagttgt 30300 
catttgccag agaacaccaa gattggcaaa 30360 
acggaaaaat gaccaaaatc tgattatttt 30420 
ttcatttgat gacttcccgg taggtctgcc 30480 
ttacaaacga tttcggacgt ctagcattca 30540 
cacccttatt aagaccctgt gcggtctcaa 30600 
aataataaag catcacttac ttaaaatcag 30660 
gcacctcctt gccctcctcc cagctctggt 30720 
tccacaatct aaatggaatg tcagtttcct 3078O 
tcatgttgtt gcagatgaag cgcgcaagac 30840 
catatgacac ggaaaccggt cctccaactg 30900 
ccaatgggtt tcaagagagt ccccctgggg 30960 
ttacctccaa tggcatgctt gcgctcaaaa 31020 
gcaaccttac ctcccaaaat gtaaccactg 31080 
acataaacct ggaaatatct gcacccctca 31140 
ccgccgcacc tctaatggtc gcgggcaaca 31200 
ccgtgcacga ctccaaactt agcattgcca 31260 
agctagccct gcaaacatca ggccccctca 31320 
ctgcctcacc ccctctaact actgccactg 31380 
tttatacaca aaatggaaaa ctaggactaa 31440 
acctaaacac tttgaccgta gcaactggtc 31500 
aaactaaagt tactggagcc ttgggttttg 31560 
caggaggact aaggattgat tctcaaaaca 31620 
ttgatgctca aaaccaacta aatctaagac 31680 
cccacaactt ggatattaac tacaacaaag 31740 
ccaaaaagct tgaggttaac ctaagcactg 31800 
tagccattaa tgcaggagat gggcttgaat 31860 
ccctcaaaac aaaaattggc catggcctag 31920 
aactaggaac tggccttagt tttgacagca 31980 
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caggtgccat tacagtagga aacaaaaata atgataagct aactttgtgg accacaccag 32040 
ctccatctcc taactgtaga ctaaatgcag agaaagatgc taaactcact ttggtcttaa 32100 
caaaatgtgg cagtcaaata cttgctacag tttcagtttt ggctgttaaa ggcagtttgg 32160 
ctccaatatc tggaacagtt caaagtgctc atcttattat aagatttgac gaaaatggag 32220 
tgctactaaa caattccttc ctggacccag aatattggaa ctttagaaat ggagatctta 32280 
ctgaaggcac agcctataca aacgctgttg gatttatgcc taacctatca gcttatccaa 32340 
aatctcacgg taaaactgcc aaaagtaaca ttgtcagtca agtttactta aacggagaca 32400 
aaactaaacc tgtaacacta accattacac taaacggtac acaggaaaca ggagacacaa 324 60 
ctccaagtgc atactctatg tcattttcat gggactggtc tggccacaac tacattaatg 32520 
aaatatttgc cacatcctct tacacttttt catacattgc ccaagaataa agaatcgttt 32580 
gtgttatgtt tcaacgtgtt tatttttcaa ttgcagaaaa tttcaagtca tttttcattc 32640 
agtagtatag ccccaccacc acatagctta tacagatcac cgtaccttaa tcaaactcac 32700 
agaaccctag tattcaacct gccacctccc tcccaacaca cagagtacac agtcctttct 32760 
ccccggctgg ccttaaaaag catcatatca tgggtaacag acatattctt aggtgttata 32820 
ttccacacgg tttcctgtcg agccaaacgc tcatcagtga tattaataaa ctccccgggc 32880 
agctcactta agttcatgtc gctgtccagc tgctgagcca caggctgctg tccaacttgc 3294 0 
ggttgcttaa cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg 33000 
tgcatcagga tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc 33060 
tccgtcctgc aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc 33120 
agcataaggc gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca 33180 
cagtaactgc agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat 33240 
ccaaagctca tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag 33300 
attaagtggc gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg 33360 
taattcacca cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc 33420 
atcctaaacc agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg 33480 
gaacaatgac agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata 33540 
tcaatgttgg cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc 33600 
cgcgttagaa ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg 33660 
cagggaagac ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc 33720 
agcggatgat cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc 33780 
ctactgtacg gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat 3384 0 
ggaacgccgg acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga 33900 
tctgcgtctc cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc 33960 
tctcaaagca tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc 34 020 
tgccctgata acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt 34080 
ctgcgagtca cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt 34140 
ccaaaagatt atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg 34200 
tggcgtggtc aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa 34260 
tggcttccaa aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag 34320 
ggtgaatctc ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc 34380 
gccaccttct caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa 34440 
tctgctccag agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc 34500 
aggttcctca cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc 34560 
ccgtaggtcc cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc 34 620 
ggccacttcc ccgccaggaa ccttgacaaa agaacccaca ctgattatga cacgcatact 34680 
cggagctatg ctaaccagcg tagccccgat gtaagctttg ttgcatgggc ggcgatataa 34740 
aatgcaaggt gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa agcacatcgt 34800 
agtcatgctc atgcagataa aggcaggtaa gctccggaac caccacagaa aaagacacca 34860 
tttttctctc aaacatgtct gcgggtttct gcataaacac aaaataaaat aacaaaaaaa 34 920 
catttaaaca ttagaagcct gtcttacaac aggaaaaaca acccttataa gcataagacg 34980 
gactacggcc atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa aaagcaccac 3504 0 
cgacagctcc tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca catcaggttg 35100 
attcatcggt cagtgctaaa aagcgaccga aatagcccgg gggaatacat acccgcaggc 35160 
gtagagacaa cattacagcc cccataggag gtataacaaa attaatagga gagaaaaaca 35220 
cataaacacc tgaaaaaccc tcctgcctag gcaaaatagc accctcccgc tccagaacaa 35280 
catacagcgc ttcacagcgg cagcctaaca gtcagcctta ccagtaaaaa agaaaaccta 35340 
ttaaaaaaac accactcgac acggcaccag ctcaatcagt cacagtgtaa aaaagggcca 35400 
agtgcgttac actgcagcag gtgtgactca gccatggcac ctctgcagcc tgggtaccct 354 60 
gcttggggca tggcccctta tagctgggcg gggcgtgggg gctctgtagg agtggcagcg 35520 
acctcagtgt ttgtctttgc tctgaagagc cctccaggtg cttgatccca ccttttccca 35580 
gcaggaacac tcctgcctgc cttaccacct gtcctggctg atggcctgtt cctgcctcct 3564 0 
ttgccccctg cccagactcc catgttcctg gacttgtggc ttcctccaac caggggctct 35700 
caagcctcca tacctggtcc cacctctcca ggccgtggga gggaggttga ggagggtgga 35760 
gggcatctgg ttgggggcag cctgggtgtt cccctcccat cccctccctg ggcctcccag 35820 
gccccctcta ctcttgagca atgctcttga gagcttcctg cctggctctt aacccagggc 35880 
aagccctgga agggcagacc caggacactc tcaccacctc cttacctttt cccctggaaa 35940 
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aatcttctgt atacttccca ttttaagaaa actacaattc ccaacacata caagttactc 36000 
cgccctaaaa cctacgtcac ccgccccgtt cccacgcccc gcgccacgtc acaaactcca 36060 
ccccctcatt atcatattgg cttcaatcca aaataaggta tattattgat gatg 36114 

<210> 17 
<211> 40 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 17 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 io 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp 
35 40 



<210> 18 
<211> 19 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 18 

Met Trp Trp Phe Ser lie Ala Leu Met Phe Val Cys Leu He He Met 
1 5 10-15 

Trp Leu He 



<210> 19 
<211> 8 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 19 

Lys Arg Arg Arg Ala Arg Pro Pro 
1 5 



<210> 20 
<211> 42 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 20 

Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He 

1 5 10 15 

He Val Leu Asn Pro His Asn Glu Lys He His Arg Leu Asp Gly Leu 
2° 25 30 

Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
35 40 
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